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STATISTICS OF THE 1954 POLIO 
VACCINE TRIALS* 


K. A. BROWNLEE 
University of Chicago 


HE Report on the 1954 Poliomyelitis Vaccine Trial was produced by 
the Poliomyelitis Vaccine Evaluation Center at the University of 
Michigan, directed by Thomas Francis with Robert Korns as Deputy 
Director and Robert Voight as Chief of Statistical Operations. Appar- 
ently the Report is a joint product of the staff of the Evaluation Center. 
The Report was launched upon the world with a volume of publicity 
probably unprecedented for a scientific work. The Report was used by 
the National Foundation for Infantile Paralysis to push the vaccine in- 
to mass use in the spring of 1955. The fact that the vaccine then being 
pushed differed very importantly from that used in the 1954 trial, in 
that it did not contain merthiolate, was released to the public only 
after the fact that some of the vaccine was causing poliomyelitis could 
no longer be ignored. The ultimate horror came when the vaccine 
caused poliomyelitis not only in some of those injected but also in asso- 
ciates of those injected. The responsibility for these tragic events, of 
course, is not that of the authors of this Report: they were concerned 
solely with the evaluation of the vaccine used in the 1954 trial. 

It is impossible not to be impressed with the courage of those who 
undertook a task of this magnitude. The Report lists 312 State and 
Local Health Officials who participated in the field trials in the U. S. 
and 6 others in Canada and Finland. Listed also are 54 physical thera- 
pists, 22 epidemiological intelligence officers, 28 laboratories with their 
principal scientists, and the 17 members of the Advisory Committee. 
The latter, incidentally, included three who are listed in the 1954 Di- 

* An invited review article on Evaluation of 1954 Field Trial of Poliomyelitis Vaccine: Summary 


Report. Poliomyelitis Vaccine Evaluation Center. University of Michigan, Ann Arbor, Michigan. 
April 12, 1955. Pp. xiv, 51, Appendix pp. 63. No price. 
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rectory of the American Statistical Association. Further, several mem- 
bers of the staff of the U. S. Bureau of Census were on leave to the 
Evaluation Center, the machine tabulations were handled by the 
University of Michigan Tabulating Service, and the Survey Research 
Center of the Institute for Social Research of the University of Michi- 
gan participated. To the above must be added countless school teach- 
ers and physicians, and lastly 1,829,916 children. 

The role of the Advisory Committee is not explained. Was it merely 
to exist and lend weight by the authority of its names? Or was it to 
provide answers to specific questions when asked, and otherwise be 
silent? Or was it to keep a collective eye on the whole proceeding and 
satisfy itself that everything was according to Hoyle? 

The first 51 pages of the Report are divided into “Plan of Study” 
(14 pages), “Results” (44 pages), and “Summary of Estimates of Ef- 
fectiveness of Vaccine” (2 pages). The remaining 61 pages are an Ap- 
pendix consisting mainly of 18 tables and 11 pages of reproductions of 
forms. 

It is not completely clear whether the staff of the Evaluation Center 
were responsible for the design of the field trial as well as its analysis: 
on page xiii we find 

The proposal was made at that time [the autumn of 1953] by the Founda- 


tion that a Center, to conduct a scientific evaluation of the effectiveness of 
the vaccine, if any, be established .. . 


One would imagine that “to conduct a scientific evaluation” one would 
have to be satisfied that the design of the trial was satisfactory. How- 
ever, the only authority explicitly ascribed to the Center is for analysis: 


The conditions under which a Center would function needed to be de- 
fined. It was agreed that... the Center would function as the executive 
agency of the evaluation program, and that it alone would be responsible 
for the analysis of all data and for any reports. 


It is thus not clear, at least to this reviewer, whether being “the execu- 
tive agency for the evaluation program” included the authority for the 
design or merely carried the responsibility for executing someone else’s 
design. Some light may be thrown upon this question by the Public 
Health Service’s Technical Report on Salk Poliomyelitis Vaccine of June, 
1955, which states (page A4): 

August 1954. The National Foundation for Infantile Paralysis contracted 


with Dr. Thomas Francis, Jr., University of Michigan School of Public 
Health, for analysis of the data derived from the field trial. 
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It seems likely that this must constitute one of the largest and most 
expensive clinical trials ever run, and it received, or perhaps one should 
say obtained, many times the press publicity of any predecessor. The 
direct cost, the reviewer infers from newspaper reports, was of the order 
of $5,000,000. It is therefore somewhat upsetting to observe that the 
trial avoided complete disaster only by what appears to have been a 
late afterthought, too late, indeed, to save more than the lesser part: 

The plan of procedure announced by the National Foundation for In- 
fantile Paralysis and its Advisory Committee was to administer vaccine to 
children in the second grade of school; the corresponding first and third 
graders would not be inoculated but would be kept under observation for 
the occurrence of poliomyelitis in comparison with the inoculated second 
graders. This has been designated the “Observed Control” study. 


Apparently some one or more persons realized the total folly of this 
scheme and were able to do something about it: 

There was introduced, therefore, a second plan corresponding in pattern 
to that usually employed in scientific investigations. Children of the first, 
second, and third grades would be combined. One half would receive a solu- 
tion of similar appearance which should have no effect on immunity to 
poliemyelitis. Each child would receive the same lot of material, labeled 
only by code, for all three inoculations. Only the Evaluation Center would 
have the key. A single lot of each material was, so far as possible, to be used 
in a given area. The children in the study would be observed thereafter and 
all reports relating to a case of poliomyelitis would be made on a concealed 
or blindfold basis without knowledge of the nature of the inoculum. 


It is a pity that explicit credit is not given to whomever was responsible 
for this change. However, only 41 per cent of the trial was rescued and 
the remaining 59 per cent blundered along its stupid and futile path. 
This dichotomy of the trial persists in the Report: equal prominence is 
given throughout to data from both the “placebo areas” and “observed 
areas” and at times careful reading is necessary to be sure which is 
under discussion. For Table XI, for example, the reviewer is unable to 
determine whether the data refer to the placebo control areas or to the 
observed areas, but presumably they refer to both. 

The Report manifests an unwillingness to come out into the open 
and reject the possibility of valid scientific inferences as to the efficacy 
of the vaccine from the “observed areas.” What for example, is one to 
make of (page 5): 

In observed areas where only those second grade children whose parents 
requested participation were vaccinated, the problem of establishing the 
control population was more complex. 
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It is perfectly true to say that it is “more complex,” but to indulge in 

understatements of this order of magnitude is to be misleading. The 
plain fact is that it is impossible. Why not say so? Continuing: 

After careful consideration of various alternatives, it was decided that the 

total first and third grade study population compared to the vaccinated 


second grade population would be the most critical measure that could be 
applied to measure the efficacy of the vaccine. 


When the “most critical measure” is worthless, why allow the unin- 
formed reader to continue with the impression that it is worth some- 
thing? 

The failure to face up to the implications of the confounding of vac- 
cination status with age and other factors is manifested in the discus- 
sion of the data (page 27): 

Table 3b presents the same type of analysis of cases in the observed study 
areas. The significance of differences in rates in vaccinated and control 
populations in the major categories are also at the .001 significance level. 


... The difference between the attack rates for bulbo-spinal poliomyelitis 
in vaccinated and controls was significant at .01. 


The unqualified attribution of these “significance levels” to vaccina- 
-tion rather than to the complex of factors differentiating the two popu- 
lations must be condemned. 

There is no need, in the pages of this Journal, to discuss the reasons 
for the need for randomization in experiments of this type, but it is 
perhaps worthy of note that this Report contains two very good illus- 
trations of its necessity 

The first example, a subsidiary study of absenteeism in Schenec- 
tady, which was a placebo area, shows very clearly that children not 
requesting participation in the trial are quite different from those re- 
questing participation. The New York State Health Department in- 
vestigated 24 schools involving a total study population of 4,207 
children. The extent and causes of absenteeism for six weeks after in- 
oculation were obtained for the vaccinated group, the placebo group, 
and also for those children for whom participation had not been re- 
quested. Thirteen hundred and sixteen (1,316) vaccinated children 
had a total of 2,869 days absence and 1,304 placebo children had 2,988 
days absence. The total days of absence are broken down (in Table 
XVII) into 34 different categories, but since only child-days are given, 
ordinary contingency table techniques cannot be used on the data 
as given. There are no very obvious differences between these two, 
though if we remove the “nonmedical” absences (482 and 390 re- 
spectively) we get mean absences per child of 1.813 days and 1.992 
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days: the difference of 9.9 per cent seems, off-hand, to be a trifle large. 
However, the most striking feature of this table is that the 1,360 
children for whom participation was not requested had a total of only 
1,458 days absence, about half that of the participating group. We can 
make an approximate test of significance by considering the number of 
categories for which the mean for the participating groups exceeds the 
absences for the nonparticipators. If we restrict ourselves to those 24 
categories for which the total absence was 10 or more child-days, the 
participators exceed the nonparticipators in all but two categories. 
While these will not be completely independent, since it will be pos- 
sible for a particular child to be absent for more than one reason within 
a six week period, presumably this effect will be minor and there seems 
little doubt that the difference is genuine. Whether the reason is that 
the parents of the participators hold their children back from school 
unnecessarily or that the parents of the nonparticipating children send 
them to school when they should be held at home is an interesting 
subject for speculation. The Report comments (page 18) on this data: 
“Absenteeism in the noninoculated population did not differ signifi- 
cantly” (from the vaccinated and placebo populations). The report 
does not state what test was used: it is very difficult to believe, in the 
light of the foregoing discussion, that there is not a difference between 
the two populations. 

The second place where the data of the Report make plain the need 
for strict randomization is in Table 2b, where, in the placebo area, 
those receiving the placebo had a significantly higher rate (57 per 
100,000) of paralytic polio, than those not participating in the trial 
(36 per 100,000). A sample survey (page 13) established that the non- 
participants belonged to lower socio-economic groups than the partici- 
pants. 

These two results demonstrate the futility of the work on the 
“observed areas.” 

A considerable amount of the Report is taken up with the results of 
a scoring scheme for the degree of paralysis and with the results of 
attempts to isolate the virus from specimens taken from patients. From 
the layman’s point of view, probably the most interesting and convinc- 
ing set of data is that which refers to the poliomyelitis cases categorized 
as paralytic and not-paralytic and not-polio. Since the paralytic cases 
are those for which there is the least difficulty in diagnosis we are 
probably justified in having greatest confidence in that part of the 
data, though some may prefer to consider only those cases confirmed 
by the laboratory. 
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It might be noted, for the benefit of the unwary, that the formula on 
page 62, section 3b, for testing the difference between the rates in two 
groups, is valid only when, in the conventional notation for the four- 
fold table, a/(a+b) and c/(c+d) are small, that is, when the relevant 
binomial distributions are well approximated by Poisson distributions, 
In the specific cases where this formula is employed in this Report, of 
course, this is the case. This formula also contains the constant }, 
presumably a correction for continuity, but this should have a negative 
rather than a positive sign. 

The most convincing evidence for the effectiveness of the vaccine is 
contained in Table 2b. Confining our attention to paralytic polio, the 
vaccinated group had 33 cases in a population of 200,745, whereas the 
placebo group had 115 cases in 201,229. With the assumption that the 
two populations differ only in their vaccination status and that we have 
independent random sampling there can be little doubt as to the effec- 
tiveness of the vaccine. The corresponding rates per 100,000 are 16 and 
57. The “effectiveness” of the vaccine, as defined in the Report (page 
62) is 100(1—Ri/R.)%, where the R’s are the rates. The over-all ef- 
fectiveness is thus 71.9 per cent. 

It is of interest to see how far the vaccinated and placebo popula- 
tions are identical except for their vaccination status. The age distribu- 
tions are given in Table III and appear reasonably similar. Inciden- 
tally, in Table III, neither for the vaccinated nor for the (placebo) con- 
trol do the sums for the age groups add to the totals given. 

Another point for which a comparison between the vaccinated and 
placebo groups can be made is with regard to their initial antibody 
titers. It seems to have been the intention (page 3) to take blood sam- 
ples from a two per cent sample of the children, and actually a 3.6 per 
cent sample (14,475/401,974) was taken. Only a fraction of these were 
analyzed, however. 

Table XII gives serum antibody titers for paired sera where the 
first blood was less than 4 units, categorized by vaccine lot and vaccina- 
tion status. By “paired sera” is meant bloods from those children for 
whom both pre-inoculation and post-inoculation samples were avail- 
able. Here we are concerned only with the titer of the pre-inoculation 
sample. An antibody titer of less than 4 units implies that the child is 
presumably low in natural defenses to polio virus. Table XII includes 
results from nine lots of vaccine; however, one of them, lot number 
513, does not appear in Table V giving distribution of study cases of 
polio by vaccine lot, so this lot is omitted from further consideration 
here. The fteport gives no explanation for its omission. 
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In Table XII we have approximately equally sized samples for 
vaccinated and “not vaccinated,” which presumably means injected 
with placebo, since presumably blood could not be obtained from 
the nonparticipants. For example, for Lot 304 there were 472 sera 
from vaccinated subjects of which 109 had titers of less than 4 units 
for all three types of polio virus: for not-vaccinated subjects there were 
408 sera of which 113 were less than 4 units. We can calculate an ap- 
proximate unit normal deviate on the assumption of the normal ap- 
proximation to the binomial distribution. For the eight lots considered, 
in the order in which they are given in Table XIJ, these are 1.568, 
0.394, 2.154, 1.310, 1.283, —1.102, 0.423, and —0.815. If one forms an 
average, weighted in proportion to the square roots of the numbers of 
sera in the lots, one obtains 1.853, which has a single-tailed probability 
of 0.032. 

Also throwing light upon this question is the statement (page 23) : 


The differences in lots are sharply marked in those [children] without 
preceding antibody to any type. . . . In the placebo area 20.6 per cent of the 
vaccinated and 22.9 per cent of the placebo controls were in this class. 


Here the Report violates one of the rules of good writing, namely 
never to quote a percentage without reference to the sample size—we 
can do little more with these figures than to conclude that they lend 
some reinforcement to the above suspicion that the trial may be biased 
in favor of the vaccine by including more children with low antibodies, 
and hence higher likelihood of polio, in the placebo group. Alternatives 
to this somewhat drastic and damning conclusion might include: 

(a) The uneven distribution of those with poor natural antibodies 
has occurred solely through chance, and what we observe is the 0.032 
probability that will occur with this frequency. Furthermore, it might 
be hoped or claimed, the magnitude of the discrepancy is trivial and 
of no practical importance. 

(b) There is some lack of independence between individuals: we 
are not, perhaps, in the classical probability situation of sampling 
balls independently from an urn, and this lack of independence leads 
to a per cent points of statistics calculated on the assumption of inde- 
pendence occurring substantially more than a per cent of the time. 
If this is so, of course, it will probably also apply to the statistical 
analysis of the polio data. 

(c) There was some departure from pure randomization in the taking 
of blood samples. It is difficult to imagine that this would not also have 
affected the injections of vaccine and placebo. 
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(dq) The blood samples were taken strictly at random, and the 
vaccinations performed strictly at random, but in the selection of 
blood samples for analysis some bias operated in the placebo group 
in favor of selecting bloods of low titers and this bias did not operate, 
or at least did not operate to the same extent, in the vaccinated group. 
For example, if a child was more likely to develop polio if his initial 
antibodies were low, and if when a child developed polio a request 
might be filed through the appropriate channel for its pre-injection 
titer to be measured, and since more children in the placebo group 
developed polio or polio-resembling diseases we would expect more 
determinations to be made on the placebo group. However, in the 
placebo group, only 162 children developed polio or “not polio” and 
since the rate of taking pre-injection blood samples was 3.6 per cent 
the expected number of children in this class is 5.8. In the vaccinated 
group there were 82 children with polio or not polio, so we would ex- 
pect pre-injection blood to be available for 3.0 of them. Clearly this is 
not a mechanism which could account for the observed discrepancy. 

Pages 22-24 give interesting two-way tables for a number of lots 
showing serum antibody changes for Types I, II, and III virus between 
the pre-vaccination and post-vaccination blood samples. It is gratify- 
ing to find not merely the marginal distributions of antibody titers but 
also the data for each individual child before and after vaccination. 
On the basis of these data the Report categorizes lots as good, moderate, 
low moderate, and poor. A good lot, obviously, gives large numbers of 
children high antibody titers and a poor lot makes relatively little 
over-all change. Table V gives the breakdown of polio cases by lot 
arranged according to the quality of the lots. For paralytic polio, we 
can calculate the “effectiveness” as 74, 55, and 79 per cent for good, 
moderate, and low moderate and poor lots combined (the two latter 
are combined here as there were relatively few children with the one 
low moderate lot). It is rather curious that there is no evidence here 
that the “good” lots were any better than the “low moderate” and 
“poor” lots. 

Of particular interest in the light of the unfortunate experiences 
with the subsequent use of the vaccine in 1955 are the studies the 
Report makes of the safety of the vaccine with regard both to reac- 
tions of all types and specifically to polio. From the data in the Report 
(page 21) it seems clear that the vaccine as used was innocent in 1954. 
It appears from other sources, however, that to obtain nonlive vaccine 
was far from easy. The Public Health Service Technical Report of 
June, 1955, states (page 9): 
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In March 1954, soon after vaccine production for the field trial was 
started, live virus was detected by monkey and tissue culture tests in 4 out 
of 6 supposedly inactivated lots of vaccine. 


and further 


In a total of 11 positive lots the testing laboratories varied in their ability 
to detect the virus... 


It is rather curious that the Evaluation Report is completely devoid 
of any reference to these problems or to the methods of manufacture. 
In fact, the Report contains no references whatsoever to medical, 
immunological, or virological literature—the sole reference is to the 
paper “Statistical Methods for Poisson Processes” in the June, 1954, 
issue of this Journal. 

To summarize, 59 per cent of the trial was worthless because of the 
lack of adequate controls. The remaining 41 per cent may be all right 
but contains internal evidence of bias in favor of the vaccinated. There 
was hope that an independent trial would be run in Great Britain under 
the auspices of the Medical Research Council, but this has been 
abandoned since they concluded that the vaccine was too dangerous. 
The reviewer may seem too skeptical in feeling the need for an inde- 
pendent confirmation of a trial run on the scale of the present one, but 
he would point out that gamma globulin was triumphantly proclaimed 
effective by the National Foundation after a similar trial, but now con- 
siderable doubts exist as to the correctness of this conclusion. 










A STUDY OF INDUSTRIAL USE OF PROBABILITY 
STATISTICS IN THE PHYSICAL SCIENCES 


JuLIaAn H. Tovu.tovuse 


INTRODUCTION 






























N ATTEMPTING to canvas opinions as to the use of probability statistics 
{ in industry in the physical sciences, for a panel discussion at the 
Montreal Meeting of the American Statistical Association, September 
10,1954, we endeavored to develop a background by a survey or ques- 
tionnaire sent to men and women listed in two biographical publica- 
tions, and whose title or position indicated some contact with develop- 
ment, research, or project problems in industry. Some were listed as 
directors of research or development, while others were members of 
such groupings. In all, 440 questionnaires were sent out, which were 
reduced to 423 by some which were returned because of incorrect 
address. 


THE SAMPLE 


The names, themselves, were chosen by a stratified sampling, about 
equal in number from American Men of Science and Who’s Who in En- 
gineering. This was done to give representation from both the scien- 
tific and engineering approach to possible usage of statistics. A start- 
ing page was chosen by opening each book at random and using the 
last digit of the page number as a starting page at the beginning 
of the book. Beginning with the top, left name on that page, the entries 
were examined in turn until a name suitable for the intent of the survey 
was found. Only people associated with an industrial capacity, and in 
some manner with development and research, were chosen, thus elimi- 
nating those in teaching, institutional, governmental, and consulting 
capacities covered by others on the panel. Thereafter, the same proce- 
dure was followed every twelfth page in Who’s Who in Engineering, and 
every fifteenth page in American Men of Science, which would give 
something over 200 names each. These selected individuals were then 
sent a letter of explanation and a blank questionnaire. 





THE QUESTIONNAIRE 


The questionnaire was a return-postcard with the questions limited 
to what could be printed on one side. The answers were expected to 
be “yes” or “no” but qualified answers were often made. Not all ques- 
tions were always answered, but the response was generally satisfac- 
tory, since 63 per cent of them were returned. There were 119 replies 
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from those of a scientific background, and 134 from engineers. One 
feature was the fact that many felt so strongly on the subject that 
letters of explanation or additional comment often accompanied the 
returned questionnaire. 

The questionnaire was as follows: 











1. Do you use statistically designed research plans? 

2. Do you have a staff member trained in probability statistics? 

3. Does he take part in the planning of experiments? 

4. Do you base research decisions on probability analysis? 

5. Do you use the following: Never Some Much Always 


Distribution Analysis 
Control Charts 
Tests of Significance 
Correlations 
Analysis of Variance 
Latin or Other Squares 
Other Advanced Methods 
6. Do you use the above as much as you: could - 
like ? 
7. Have you a program to increase use of probability statistics? 












































, would 





8. If you do not use these methods, is it: cost , personnel 


, too new , too formal 








not applicable , too in- 


volved 
9. Do you wish a summary of the answers to this questionnaire? 








? 





Only one key was used in sending out the questionnaire. To scientists 
a three line return address was placed on the card, and to engineers 
an additional line carried this author’s engineering title. Thus the two 
fields could be kept separate as to answers. Also, no element of per- 
suasion was used. 

The questions were not only for factual answers, but also for inter- 
comparison purposes. Question 6 offered a measure of interest, by the 
contrast between the two sections, where the respondent might indi- 
cate he was using the method as much as he could (with budget, 
personnel and other limitations) but not as much as he would like. 
The answer to question 9 was an important gauge of their interest in 
the subject, far beyond the mere desire to see what was said. 
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Obviously, the answers to so many questions gave a wide degree 
of combination and inter-comparison. I have made no attempt to go 
beyond the relation between pairs of questions, and submit the com- 
parative answers as a general summary. Each returned questionnaire 
was copied on a McBee-Keysort card for facilitating means of analysis. 

In Table 1, across the top, appear the questions which could be 

answered “yes” or “no,” which means that questions 5 and 8 were 
omitted. Under each question number appears a word which should re- 
fresh the memory as to the complete question as outlined earlier in 
this report. Below each question number then appears on three lines 
the numbers of replies where the answer was “yes” (including qualified 
yes), or “no,” or where the space on the questionnaire was not filled 
out. The rest of the table needs an explanation, since the data have 
been summarized as briefly as possible. It has a line for each of the 
questions, with somewhat more of a refresher as to the context of the 
question, and the body of the table, except for question 5, gives the 
proportion of “yes” answers to the question on the line, divided as to 
whether the answer was “yes” or “no” to the question of the column. 

In other words, for example, what proportion of those answering “yes” 
to question 1 also said “yes” to question 2, and what proportion saying 
“no” to question 1, said “yes” to question 2, etc., etc. In each case 100 
minus the figure quoted is the proportion of “no” (or “not answered” 
in the case of question 9) answers. For example, the first pair of figures, 
under the column for question 1 appears on the line for question 2 at 
85/33. This means that of the 140 who answered question 1 as “yes,” 
meaning that they did use statistical methods, 85 per cent had a staff 
statistician. (100—85, or 15 per cent said no or did not answer.) This 
includes the qualified answers, most of which described the statistician 
as self-trained, or trained in short courses and the like. Of the 111 
who answered question 1 as “no,” 33 per cent did have a staff statis- 
tician, or a man with some knowledge of statistics—but evidently they 
did not use him! 

One can go through the entire table on this basis and make his own 
interpretation as to the meanings. I have made no attempt to test the 
significance of nearly similar answers. The most important differences 
are obvious, and a few will be pointed out. Take the answers in the 
column for question 1, on the line for 6c and 6w, the letters of which 
key to the words “can” and “would like” in the complete question. 
Of those who answered “yes” to question 1, 48 per cent believed they 
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were using these methods as much as they could. Later it will be seen 
that personnel more than anything else was the limitation. The answer 
of these same people to 6w gives the important fact—only one in 
seven (14 per cent) was satisfied with the extent of his use of statistics 
—siz out of seven wanted to use the statistical method more. It can be 
inferred that many limitations of budget, people, newness, and the 
like imposed the undesired restrictions. Those who did not use statisti- 
cal methods kept the reversed trend. Only one-quarter of those answer- 
ing “no” to question 1 were evidently using some form of statistics 
(there may be no real inconsistency because question 1 refers to re- 
search methods) as much as they could, but even more, 7 per cent were 
completely satisfied with the extent of such use. 

A bright spot for the side of the statistician is the fact that in answer- 
ing question 7, 56 per cent of those using statistics wanted to increase 
this use, but only 18 per cent of the non-users wanted to learn how to 
use statistical methods. These approximate ratios held for the first 
four questions, except that few (5 per cent) of the non-users were inter- 
ested in design applications. Further in column 1, the answers to ques- 
tion 8 may be considered. Multiple answers were possible here because 
more than one blank space could be checked off. There was a difference 
in that 57 of the 79 answering question 1 as “yes,” did not fill out 
question 8 at all, because of its phrasing, whereas only 11 of the 61 
who gave a qualified “yes” and 16 of the 111 who gave a “no” answer 
failed to fill out this reason for not being able to use statistics more. 


REASONS FOR LACK OF USE 


The 22 answering “yes” to question 1 and filling out one or more 
blanks in question 8 gave 43 answers, and these were combined with the 
50 who gave qualified answers to question 1 and gave 66 answers to 
question 8. The biggest hindrance was personnel, named 35 per cent of 
the time. Non-applicability was given 19 per cent of the time, and 
other answers as shown in column 1. Those who answered “no” to 
question 1 reversed the major reasons, naming non-applicability 38 
per cent, and personnel 25 per cent of the time. And while these ratios 
held closely true all across in the cost and personnel lines, it was not 
true of those who answered question 2, where personnel and non- 
applicability were named about equally. In other words, if they had a 
statistician they were less likely to name personnel, and if they did not 
have a statistician, there was more reason to name personnel as a de- 
termining reason. Hence, we might conclude that one feature in 
industry use is the lack of trained statisticians. 
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EXTENT OF USE 


Question 5 needs further explanation. The possible answers were 
“never,” “some,” “much” or “always.” These were weighted as 0, 10, 
25 and 50 respectively. All of the answers were averaged according to 
these weights and “scores” which might indicate the relative use of the 
methods were thus developed. (For example: Of 140 who answered 
question 1 as “yes” or “qualified yes”, 8 “never” used tests of signifi- 
cances, 68 “some,” 57 “much,” and 7 “always.” The score would be: 


[(8 x 0) + (68 X 10) + (57 X 25) + (7 X 50) 1/140 = 17.53 


This was recorded as “18” in the proper place.) The data in Table 1 
indicate how wide the difference is between users and non-users of the 
statistical method, but they also indicate that even the non-users have 
some familiarity. Note how in each case the degree of usage builds up 
to a high for tests of significance being less for both the more basic 
and the more complex. The drop-off for the more complex methods, 
however, is precipitous with the non-users. 


ENGINEERS VS. SCIENTISTS 


While many more inferences and interpretations can be gained from 
Table 1, we will leave this to others. Let us now briefly examine the 
answers from those primarily with a science background with those 
primarily from engineering, at least so far as their listing in the two 
biographical indexes is concerned. Actually, valid differences are hard 
to find. A few will be pointed out. 

Scientists seemed more reluctant to answer question 6c, since 32 
failed to answer, as compared with 2 engineers. About equal propor- 
tions answered this question “no.” The difference was therefore among 
those who felt that they used these methods as much as they could. 
Apparently more engineers felt that they had reached the limit of 
their physical restrictions or were more decisive in their opinions. 
On the other hand the engineers, answering question 6w, were more 
reluctant to commit themselves, although fewer felt that they were 
using these methods as much as they would like. Again differences 
appear in the methods used. The science group reached higher scores, 
indicating more use of the methods. This shows up most in the use of 
tests of significance. To some extent engineers only exceeded the 
scientists in the use of control charts, probably a direct carry-over 
from their use of control charts in production, and in use of correla- 
tions which again, may understandably be a feature of engineering 
practice. 
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TABLE 1 
SUMMARY OF 253 REPLIES FROM 423 QUESTIONNAIRES SENT OUT 








Question Number 





3 4 6c 6w 7 
Ana- Pro- 
Design| lyze | Could | Like | gram 





Number of Replies to the Abeve 





156 169 84 28 96 
96 75 135 140 136 
1 9 34 85 21 








Percentage answers “yes” after yes/no above 





. Do you use? — 75/25 | 87/17 | 78/8 | 48/64 | 68/61 | 80/40 | 62/35 
. Have a statistician? 85/33 _— 98/49 | 75/35 63/64 | 71/64 | 85/46 | 67/46 
. Use him in design? 84/14 | 84/1 — 57/13 | 60/55 | 64/53 | 81/35 | 57/38 
. Analyze results? 94/34 | 80/45 | 91/35 a 57/76 | 71/76 | 86/57 | 74/46 
. Use as much as: 
c. you can? 48/24 | 28/18 | 24/20 | 21/23 -- 43/10 | 26/24 | 42/47 
w. would like? 14/7 14/7 14/29 | 14/8 | 20/44 — 7/15 | 10/44 
. Program to increase? 56/18 | 53/12 | 59/5 49/16 | 33/45 | 18/53 — 47/9 


(Multiple answers possible—Per cent is of total answers) 





8. What hinders? 
12/8 9/8 11/6 13/5 11/9 11/12 | 15/7 12/5 
35/28 | 23/36 | 32/30 | 36/28 | 25/36 | 21/40 | 34/31 | 32/21 
19/38 | 22/30 | 24/33 | 20/46 | 39/22 | 32/15 | 18/34 | 26/46 
13/9 10/9 14/10 | 11/8 7/14 | 16/12 | 19/8 11/13 
6/3 4/4 4/6 6/1 5/4 0/6 5/4 5/4 
15/13 | 11/13 | 16/15 | 15/12 | 18/14 | 21/14 | 11/15 | 16/11 























84/36 | 81/34 | 80/38 | 83/43 | 68/18 | 14/12 | 94/37 _ 


Scores after yes/no above 





(Weighted: None =0, Some = 10, Much =25, Always =50) 
12/5 11/5 12/5 11/3 9/8 12/9 12/7 10/5 
12/7 12/7 12/7 12/6 10/15 | 14/10 | 13/8 11/8 
Significance 18/7 16/8 17/9 17/5 12/14 | 17/138 | 17/10 | 9/9 
Correlations 16/6 14/8 15/8 14/6 13/12 | 15/13 | 15/10 | 13/8 
Analysis of Var 14/4 12/5 14/5 13/3 8/11 | 11/11 | 14/7 11/7 
Latin Square 9/1 8/3 9/2 8/2 4/7 7/6 8/4 6/4 
7/1 6/1 7/1 6/1 4/5 6/5 8/2 5/3 





























'* Answer considered “no” if not checked in space provided. 


QUALIFIED AND EXTENDED ANSWERS 


Beyond the answers to the questions, the letters which accompanied 
them were of interest. Many were in explanation of the answers, and 
served to provide the qualification used in our interpretation. Many 
more were apologetic in the case of the “no” answers. Some of the 
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answers are quoted here because of their indications of usage and 
thinking. In some cases two or more excerpts appear from a single 
letter. The answers are grouped according to kind as shown by the sub- 
headings that follow. Note the contradictions: 

Use. “Our primary use . . . in Production Process Control.” “In this 
field (experimental engineering) we very definitely make use of statis- 
tics and mathematics of probability.” “We do make extensive use . . . 
developing the broad programs for extension of the system. Another 
area is... analysis of rate schedules. Another . . . reserve capacity.” 
“On some jobs very useful in establishing facts.” “Use only in produc- 
tion.” “Used in production but not in research.” “In this field (experi- 
mental engineering) we very definitely make use of statistics and 
mathematics of probability.” “We are interpreting in the broad sense 
including industrial and manufacturing research rather than just 
fundamental research.” “Multiple curvilinear graphical correlation 
always used.” “I use where possible... this is not often.” “I con- 
stantly have occasion to use these methods.” 

Programs. “Our programs...one of education and developing 
the basic concepts with the control and research groups.” “Forty-five 
people in research and control enrolled in a course.” “Self-teaching is 
not a satisfactory method.” “We have trained a considerable number of 
our staff.” “No specialists. Eight men with limited training.” “Some of 
our group have had preliminary courses in statistics and have done 
additional reading on their own.” “Are training... also have con- 
sultants.” “Several staff members have a good working knowledge.” 

Interest. “All of us in the technical and scientific (staff) are fully 
sold on the principles.” “In our research department we are at least 
conscious of the statistical methods.” “We are very cognizant of the 
matter and have talked over some of the applications.” “Lack of ap- 
preciation ... various numbers suggest an experimental design... 
since the persons primarily concerned with the program do not have 
an appreciation ... they carry out their programs by changing one 
variable at a time.” “A lot depends on the basic viewpoint . . . ranges 
from having nothing to do with it, as too long-haired, up to a more 
rational view.” “Personally, I am interested in these techniques, but 
find other personnel untrained and uninterested.” 

Results. “The jump from rule of thumb . . . all the way over... is 
extremely great ... end up with some sort of practical compromise.” 
“The use... (is) .. . practically nil. This is certainly not as it should 
be because we are primarily a research and development organization.” 
“Some research men use the statistical approach in drawing conclusions 
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but conceal the fact they have done so, or soft pedal it, in writing their 
reports.” 

The statistician. “We do not have a man specially trained (MS or 
PhD) in statistics . . . three or four of staff studying.” “In relatively 
few cases does the statistician have full responsibility . . . in most cases 
trying to work from the restricted data . . . at considerable sacrifice 
to the scientific niceties.” 

Objections. “Methods of interpreting results appear to be quite in- 
volved and time consuming.” “Statistical people make their knowledge 
seem very mysterious and lofty.” “You are probably aware that a 
skilled person can draw approximately the same straight line through 
a group of points as would be determined by the method of least 
squares.” “He may (apply statistics) on his work but not on mine!” “We 
are civil engineers . . . no need for statistical analysis.” “Not applica- 
ble in automotive research.” “But not for research planning.” “Prob- 
ability analysis simply not applicable.” 

Limitations. “We use these methods, but not as much as we would 
like because personnel is not available” “Basically ...no,... be- 
cause of lack of familiarity with its use.” “Higher skilled personnel 
probably would increase our use.” “We are checking this approach 
currently.” “In a few research studies.” “Would like to use more, but 
higher executives need to be educated.” 

Summarizing, we believe that the use of statistical methods is 
progressing more rapidly than most of us were aware. Interest is great 
even among those not yet “sold” on the idea. The younger engineer or 
scientist is often more aware than his superior, probably because of 
more recent schooling and contacts. Personnel is the greatest limiting 
factor where the use of the method has started. Many of those not 
using the method do not yet realize the applicability. Cost is not con- 
sidered a factor, but some deterrent is found in the formality and in- 
volvement of the methods. Generally, desire to use the methods out- 
weighs ability or limitations. Even among those not using the methods, 
36 per cent of the respondents wanted to know what answers others 
gave, while users wanted answers to the tune of 84 per cent. This aver- 
ages 66 per cent, and about the same number (63 per cent) took the 
time to answer the questionnaire at all. 

Industry unquestionably is “aware” of the statistical method of 
approaching a decision. 





TIME SERIES PROBLEMS IN AERONAUTICS* 


Harry Press 
National Advisory Committee for Aeronautics 


This paper reviews some recent applications of random proc- 
ess theory to problems in aeronautical engineering. A number 
of random type disturbances that cause aeronautical problems 
are described and some of their effects on the airplane indi- 
cated. Particular attention is given to the effects of atmos- 
pheric turbulence in giving rise to airplane loads and structural 
stresses. The general approach used in calculating airplane 
responses to random disturbances by means of power spectra 
techniques is then described. As a concrete illustration, some 
results obtained in a study of missile behavior in rough-air 
flight are presented. An effort is also made to indicate some 
unsolved statistical problems that are encountered in these 
applications. 


INTRODUCTION 


M“x problems in the physical and engineering sciences are con- 
cerned with the response of a dynamic system such as an air- 
plane, a ship, or an electronic circuit to an irregular or random dis- 


turbance in time. This paper is concerned with such problems in 
aeronautics. In the study of these problems, the statistical techniques 
of random process theory are beginning to play an important role. A 
list of references of recent work in this field is given at the conclusion 
of this paper along with a few references to the statistical literature. 
It is the purpose of this paper to describe some aspects of these applica- 
tions of random process theory. It is hoped that the paper will serve 
to stimulate interest among statisticians in random process theory 
and encourage the development of the theory and the solution of sta- 
tistical problems arising in applications. 

In aeronautics, a number of continuous random-type disturbances 
occur and affect the safety and performance of the aircraft. Examples 
include: 


(a) atmospheric turbulence 

(b) fluctuations in the aerodynamic forces associated with the phe- 
nomenon of buffeting 

(c) roughness of runway surfaces (both land and water) 
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(d) noise in electronic systems used for automatic pilots, guidance, 
and gunnery. 


The design of aircraft for safety, adequate performance, and successful 
operations requires the determination and the control of the airplane 
behavior under the influence of these disturbances. 


SOME PROBLEMS INVOLVING CONTINUOUS RANDOM DISTURBANCES 


Atmospheric turbulence. In the most general form, the problem of an 
airplane in flight through rough air can be considered one of a complex 
dynamic system under the influence of a four-dimensional random 
velocity vector disturbance; that is, three space dimensions and a time 
dimension. The complete airplane response in time involves displace- 
ments and rotations in regard to three space axes as well as flexural 
responses of the airplane structure itself. So far, only limited approaches 
to this problem have been attempted. As a simple example, the effect 
of only the vertical components of the turbulence on airplane wing 
load is iliustrated in Fig. 1. This is one of the more important aspects 
of the turbulence problem. 

As seen in Fig. 1, the airplane is in flight with the forward velocity V. 


GUST LOADS 
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Fig. 1. The response of an airplane to vertical gusts. 
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When the airplane encounters a vertical gust of velocity W, the air. 
plane effectively attains an angle-of-attack change Aa. For smal] 
angles, Aa is approximately equal to w/V. This angle-of-attack change 
in turn results in a roughly proportional increment of load on the wing 
a resultant vertical acceleration of the airplane, and associated stress 
increments in the structural members. Atmospheric turbulence, jin 
general, is characterized by a continuous variation in space of the air 
velocity. A representative variation of vertical gust velocity in space 
as measured by an airplane in flight is shown on the lower part of 
Fig. 1. These disturbances, as indicated for example by the results 
reported in [1], [3], and [16] appear to exhibit the principal characteris- 
tics of a random process and locally appear to approximate stationary 
Gaussian stochastic processes. 

An airplane in flight through a gust disturbance of this type will in 
turn experience sizeable incremental aerodynamic wing loads such as 
shown by the lower curve. In addition to the vertical gust disturbance, 
simultaneous lateral and longitudinal velocity fluctuations will also be 
present and give rise to associated disturbances in the vertical motions 
as well as in the lateral and longitudinal motions of the airplane. These 
loads and motions affect the structural integrity of the airplane and 
may induce failure either due to overstressing or through fatigue. They 
may also seriously affect the stability and control of the airplane and 
its suitability as a bombing or gunnery platform. 

Buffeting. “Buffeting” is an aerodynamic phenomenon and, as illus- 
trated by the sketch in Fig. 2, also involves turbulent air motions. 
In this case, however, the turbulent motions arise from the breakdown 
in the smooth flow over the airplane surface. These turbulent motions 
in turn produce irregular and erratic aerodynamic forces on the air- 
plane such as indicated by the traces for the differential pressure be- 
tween the upper and lower wing surfaces at two points on the wing 
“a” and “b.” These local pressure fluctuations are related in a complex 
fashion with pressure disturbances moving down and across the wing 
and undergoing continual change. Some indication of these relations 
can be seen from the sketches on Fig. 2 where pronounced bumps may 
sometimes be seen to move progressively toward the trailing edge. 

These fluctuations in pressure give rise to sizeable increments of wing 
load and structural stresses and also affect the controllability of the air- 
plane. An important phase of this problem is concerned with the defi- 
nition of the characteristics of the pressure disturbances and their 
relation to the physical parameters involved, such as the airplane 
speed, angle of attack, and the air density. This definition would pre- 
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Fic. 2. The phenomenon of airplane buffeting. 


sumably then form the basis for the airplane response determination. 

Runway roughness. The irregularities in runway surfaces affect the 
airplane in taxiing and during the take-off and landing operation. As 
shown in Fig. 3, the detail surface of a runway is not smooth, but ir- 
regular, and contains erratic variation in height. The relative height 
of the runway irregularities in this case has been greatly magnified for 
illustrative purposes. Generally, the height of these irregularities is 
only a few inches. With the higher speeds of modern aircraft in ground 
operations, these small disturbances in the runway may give rise to 
large airplane normal (vertical) accelerations and in some cases appear 
to have resulted in structural failures of aircraft components through 
the excitation of structural vibrations during taxiing, landing, and 
take-off operations. Similar difficulties are also encountered in seaplane 
operations in rough water. 


AIRPLANE STRUCTURAL STRENGTH 


One of the important effects of the foregoing disturbances on the 
airplane is to give rise to stresses in the structural members. In order 
to indicate the significance of these stresses to the structural integrity 
of the airplane, Fig. 4 illustrates the relation between the structural 
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Fia. 3. The problem of an airplane landing on or 
taking off from a rough runway. 
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Fia. 4. Comparison of fatigue strength and 
stress history in operations. 
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strength of the airplane and the stresses experienced in operations. 
The solid curve shown is the so-called S-N curve which defines the 
structural strength of a material or an airplane component in terms 
of the number of cycles of stress required for failure for various levels 
of stress amplitude. In the figure, one stress cycle at the highest level 
causes a failure by exceeding the static material strength. Also, re- 
peated stresses at lower levels of stress can likewise cause a failure, a 
so-called fatigue failure. 

The stress history experienced by an airplane arises from a number of 
causes. Rough air is, in many cases, the most important source of these 
stresses. These stresses may be represented on the same type of plot 
and are shown for a given operation by the dashed line. The dashed 
line actually represents a frequency distribution of stresses and defines 
the number of stresses at the various stress amplitudes. The exact rela- 
tion between a random stress history and fatigue life is not known and 
is an important structural problem. However, engineering rules involv- 
ing concepts of cumulative damage are frequently used to estimate 
fatigue life from such comparisons. With continued airplane operation, 
this dashed line approaches the S-N curve at either the upper or 
lower level and may give rise to a structural failure by overstress or 
by fatigue. For this reason, the prediction of the stress history is re- 
quired for design purposes and, specifically, such quantities as the 
largest stress and the number of stress oscillations at the various stress 
levels are of concern. The problems involved in calculating such quanti- 
ties are considered in the next section. 

RESPONSE CALCULATIONS 

The aeronautical problems considered in the foregoing discussion, 
in general, have the following common components: an erratic dis- 
turbance, an airplane dynamic system, and the airplane response to the 
random disturbance. The general approach to response calculations of 
this type is outlined in this section. For simplicity, the discussion will 
be largely confined to the case of a linear system and Gaussian dis- 
turbances. For a linear system, the response at a given time y(t) to an 
arbitrary disturbance starting from zero at t=0 is given by the follow- 
ing relation 


y(t) = f x(t) A(t — t)dh (1) 


where x(t) is the disturbance and A(t) is the response to a unit pulse 
at time 0 and is a form of weighting function. If z(t) and A(é) are ex- 
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plicitly known, the response may be calculated as a function of time 
and the characteristics of the response of concern examined in detail. 
This is the classical approach to the problem of response calculations 
for arbitrary but known disturbances. 

For random disturbances, the time function is not expressible jn 
simple explicit form. However, if the process is stationary, it may be 


described by an invariant property, the autocorrelation function R(r) 
defined as 


a. or 
R(r) = in oT _ on + r)dt (2) 


For the special case of a Gaussian disturbance this function completely 
describes the process. 

Another, and for many purposes more useful, representation of the 
random disturbance is given by the Fourier transform of the auto- 
correlation function, the power spectral density function (w). The 
autocorrelation and power spectral density functions are reciprocally 
related in the following manner: 


O(w) = = f "Re cos wrdr 


R(r) = f % cos wrdw 


It is apparent from equation (3) that 


R(o) = f "8(w)de (4) 


where it is assumed that z(¢) =0. Thus, the power spectrum can be seen 
to represent a frequency analysis of the variance withYthe element 
(w)dw defining the contribution to the variance of the components of 
z(t) having frequencies between w and w+dw. 

For a linear system, a useful and simple relation exists between the 
spectrum of a disturbance ®,(w) and the spectrum of the system re- 
sponse to the disturbance #,(w). This relation is given by 


,(w) = .(w)| T(iw)|? (5) 


where T(iw) is the system frequency response function and describes 
the system response to sinusoidal disturbances of unit amplitude at the 
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yarious frequencies. The bars designate the absolute value of the com- 
plex quantity. The frequency response function T'(iw) is also simply 
related to the unit impulse response function A(t) by the following 
relation 


T (tw) = f “ A()e-#'dt (6) 


Equation (5) is the basic relation between the input and output 
spectra and has been used widely in the calculation of system responses 
as well as for the determination of input spectra from measurements of 
system responses to random disturbances. For the case of a Gaussian 
disturbance and a linear system, the response is likewise Gaussian and 
is thus completely specified by the spectrum or its Fourier transform 
the autocorrelation function. The amplitude distribution in this case 
depends only on the variance. 

Additional relations between the power spectrum and the time- 
history characteristics have been investigated in an extensive study by 
§. O. Rice [22] and by others. For the special case in which the dis- 
turbance is Gaussian, Rice has derived a number of relations which 
appear useful in aeronautical applications and are particularly signifi- 
cant for fatigue studies. Of these, the more important appear to be 
expressions for the average number of times per second, N.(y), that a 
given value of y is crossed with positive slope and the average number 
of peak values (maxima) per second, N,(y), that are above a given 
value of y. The average number of times per second that a given 
positive value of y is crossed with positive slope (or a given negative 
value is crossed with negative slope) is given by 


1 o 29,2 
Ney) = — — etm (7) 

2r o 
where o; is the root mean square value of y’(t), the time derivative of 
y(t), and is related to the spectrum of y(é) by the relation 


n= Vivo; = | J “w8(o)de] (8) 


The expressions for the number of peaks per second that are above a 
given value of y (or the average number of minima or “valleys” per 
second that are below —y) is somewhat complicated but for large 
values of y 


N,(y) = N-(y) (9) 
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For most practical cases, it appears that this approximation is quite 
good for y>2c. The total number of peak (maxima) values of y (or 
the number of minima) which includes the maxima for positive and for 
negative values of y, is given simply by 















(10) 





. -) 1/2 
o2 = Vi") = lf ut (0)de (11) 


and y’’() is the second derivative of y(t) with respect to ¢. It may be 
remarked that, at least for some cases, the loads experienced by an 
airplane appear to follow a simple Gaussian distribution. (See, for 
example, [15] and [17].) The foregoing relations between the spectrum 
and the peaks, equations (7) to (11), which apply to Gaussian dis- 
turbances have been tested empirically in several aeronautical applica- 
tions (for example, [17]), and have been found to apply reasonably well. 

It is worth noting that the response of a system to several simul- 
taneous random disturbances such as vertical and longitudinal turbu- 
lence may also be handled in an analogous manner to that for the single 
disturbance. For the case of two disturbances, the response y(t) is 
given by 



































y(t) = f ama = ti )dt, +f euart —_ t)dty (12) 





where 2;(#) and 22(¢) are the two disturbances and A;,(¢) and A,(¢) are 
the unit impulse responses of the system to the respective disturbances. 
In terms of the power spectra, the output is then given by 


#,(w) = &2,(w)| Ti(iw)|* + &2,()| T2(éw)|?* 
+ $,,2,71* (tw) T2(iw) + ®,,2, 7 1(iw) T2* (tw) 
where ®,, and ®,, are the power spectra of x(t) and z(t), Ti(iw) and 
T:(iw) are the system frequency response functions for the respective 


disturbances (the asterisk designates the complex conjugate), and 
$,,., and ®,,,, are the cross-spectra defined by the following relations: 














(13) 




















1 re ) 
alll —J Rayz,(t)@-**dr (14) 





where 





T 


1 
R,,2,(7) = in or x1 (t)x2(t + 7)dt (15) 
T 





—>o 








Ti 


th 


TIME SERIES IN AERONAUTICS 1031 


The cross-spectrum in contrast to the power spectrum is complex with 
the real part providing a measure of the in-phase power sometimes 
called the co-power and the imaginary part providing a measure of the 
out-of-phase power sometimes called the quadrature power. 

Equation (13) may be used directly to determine the spectrum of the 
output to two random disturbances. For purposes of determining the 
spectra of the input disturbances from measurements of the output 
response, measurements are required, in this case, of two different re- 
sponses. Such applications have, for example, been made in aeronautics 
in the determination of two (or more) components of turbulence affect- 
ing an airplane in flight (see, for example, [16)). 

The preceding discussion has been confined to the simplest case of 
Gaussian processes. For non-Gaussian processes, the input-output rela- 
tions of equations (5) and (13) still apply. However, in these cases, the 
variance spectrum does not provide all the information on the process 
that is required and recourse appears necessary to higher order spectra 
and co-variance functions. Theoretical relations for the determination 
and application of such higher order spectra and co-variance functions 
are, for example, given in [11]. However, because of their complexity, 
no numerical applications of these relations have as yet been made. 


INPUT SPECTRA 


From the foregoing discussion, it is apparent that output calculations 
require a knowledge of the input disturbance spectrum ,(w) and the 
system frequency response function T(iw). The determination of the 
frequency response function for a particular airplane is an engineering 
problem and can usually be obtained from theory by the solution of the 
equations of motion. In many cases, experimental methods of determin- 
ing T(iw) are also available. The input spectrum usually has to be de- 
termined experimentally. As an example of the results obtained in ex- 
perimental measurements of input disturbances, some measurements 
of the spectrum of the atmospheric turbulence will be described. Flight 
measurements of turbulence spectra have recently been made using 
airplanes and some of these results obtained from various sources are 
summarized in Fig. 5. 

The ordinate of Fig. 5 is the power density, the abscissa is frequency 
in radians per foot. The frequency argument 2=27/), where J is the 
gust wavelength, is useful for airplane studies since insofar as the air- 
plane is concerned, turbulence is essentially a spacial phenomena. The 
data shown, in most cases, are for the vertical component of the turbu- 
lence. In one case, marked by the squares, the data are for the horizon- 
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Fia. 5. Measured power spectra of atmospheric turbulence. 








tal component or longitudinal turbulence. The data cover a range of 
gust wavelengths of from 10 to 3000 feet, which is the region that is 
of most concern for many airplane problems. The spectra differ in in- 
tensity as indicated by the height of the spectra reflecting the variation 
of turbulence intensity with the weather conditions. The spectra, in 
each case, show rapid decreases in power with increasing frequency. In 
fact, the spectra vary roughly as 1/0, a result in general agreement 
with theoretical results obtained in the theory of isotropic turbulence. 
The general shape of these spectra, with rapid decreases in power at the 
higher frequencies, has led to some important implications in regard to 
the design of airplanes for smoother flight. 

For design purposes, the over-all gust experience in operations is of 
concern. Presumably, the over-all experience consists of various expo- 
sure times to each of the spectra of Fig. 5 as well as for other spectra 
of turbulence associated with different weather conditions. These 
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spectra measurements have suggested that, as a first approximation, 
the spectral shape might be considered fixed and only the intensity or 
root-mean-square gust velocity considered to vary. Thus, the over-all 
gust experience may be considered a single parameter process where the 
parameter is the root-mean-square gust velocity. Operational data on 
gust loads have been used to estimate the probability distribution of 
this parameter, the root-mean-square gust velocity for the atmosphere. 
These results provide a basis for the calculation of the airplane gust 
response history in operations. 

Spectra measurements are also being made in regard to other aero- 
nautical problems. The roughness of runways and stormy seas and the 
characteristics of electronic noise are a few examples in which spectra 
measurements are being made for purposes of design study. 


AN ILLUSTRATIVE APPLICATION 


In order to provide a concrete illustration of the applications of time 
series techniques to a particular aeronautical problem, some results 
obtained in a recent study [17], will be described. The study to be de- 
scribed is concerned with the behavior and loads of a high-speed missile 
in rough-air flight. It was undertaken in an effort to verify the results 
of theoretical studies which indicated that, for the particular test con- 
ditions, missile stability changes associated with increasing forward 
speed would give rise to large increases in missile loads. 

The missile was a tailless missile and was fired at the NACA missile 
test center at Wallops Island, Virginia. The total missile flight lasted 
only about 20 seconds and yielded 2 seconds of usable record at each of 
eight test speeds. Inasmuch as the samples that could be obtained were 
necessarily small, there was some doubt that statistically significant 
test results could be obtained with this technique. Fig. 6 shows time 
histories of the missile normal (vertical) acceleration at two of the test 
Mach numbers which correspond to airspeeds of roughly 750 miles per 
hour and 600 miles per hour. Theory had predicted pronounced differ- 
ences in loads for these two test conditions and this can be seen qualita- 
tively to be the case from the relative magnitude of the fluctuations. 
The missile vertical acceleration in both cases is seen to be somewhat 
irregular although the predominating motion occurs at a frequency of 
roughly 10 cycles per second. The frequency characteristics of the rec- 
ords will be considered in more detail in connection with the power 
spectra. 

The distributions of the acceleration amplitude were determined 
from the records (readings at 0.01 second) and are shown in Fig. 7 for 
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Fra. 6. Time history of missile normal acceleration in 
rough air at two speeds. 


some of the test airspeeds. The figure shows the frequency distributions 
for several test speeds or Mach numbers. The dashed lines are fitted 
normal distributions. The reasonable fit of the normal distribution 
curves suggests that the process may be considered to have a Gaussian 
amplitude distribution. The application of chi-square tests further 
tended to support this hypothesis. 

In Fig. 8, some of the expected and measured power spectra are 
compared. The calculated spectra for three of the test speeds are shown 
on the upper left and are based on airplane measurements of the turbu- 
lence over the test course and missile response theory. The speeds 
covered here represent a speed increase of about 10 per cent from V; 
to V3. These results, as can be seen from the relative areas or the RMS 
values shown, suggest rather rapid increases in power for the speed 
range covered. Also, the spectra move progressively to higher fre- 
quencies with increasing speed and become more peaked as a result of 
reduced damping. The measured spectra are shown on the lower right 
and were obtained by using the estimations techniques developed by 
John Tukey, [24]. The estimates shown are rather crude in two respects. 
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Fie. 8. Comparison of calculated and measured power spectra. 
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First, due to the small samples available, roughly 2 seconds for each 
spectrum, it was necessary to estimate average power over rather wide 
frequency band widths. In these cases, each point represents an average 
over roughly +1.5 cps about the values plotted. Secondly, the small] 
number of degrees of freedom for each estimate give rather poor statisti- 
cal reliability for the individual estimates. The root-mean-square 
values are also shown and under the circumstances are considered in 
reasonable agreement with those calculated. 

In order to compare the calculated results with those measured, the 
calculated spectra were averaged over frequency bands in a roughly 
equivalent manner to the averaging contained in the spectra measure- 
ments and the results obtained are shown on the lower left of Fig. 8. 
The calculated results with this modification are seen to be in good 
agreement with those measured and reflect the rather rapid increases 
in total power with speed, the progressive movement of the spectra to 
higher frequency, and the increasing peakedness of the spectra. These 
results thus appear to substantiate the theoretical expectations and 
have also served to establish the feasibility of the testing and analysis 
technique in spite of the limitations on sample size. 

In order to test the reliability of theoretical results for the simple 
Gaussian case for the determination from the power spectrum of the 
number of peak amplitudes exceeding given values, a comparison of the 
number of accelerations counted from the test records with the number 
calculated from the measured spectra was made. Some of the results are 
summarized in Fig. 9. The circled points in each case represent the total 
number of peak accelerations (maxima and minima) obtained from the 
records. The dashed curves represent the expected number and were 
calculated from the measured spectra using equations 7 to 9. The good 
agreement obtained here and in other such studies has indicated the 
feasibility of calculating these quantities by spectral techniques. 


CONCLUSION 


The foregoing discussion has served to describe some of the applica- 
tions of random process theory to aeronautical problems. The concepts 
and techniques of random process theory are finding increasing use in 
the aeronautical sciences. In many of these problems, additional de- 
velopments in statistical techniques are needed to treat the problems 
adequately. As a conclusion, it appears appropriate to mention a few 
areas in which further developments would find immediate practical 
applications. These include: 

(1) The further study of relations between spectra and time-history 
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Fra. 9. Comparison of calculated and measured average 
number of peak values per mile of flight. 


characteristics: For the Gaussian case, a number of relations which are 
proving useful have already been derived for such quantities as the 
expected number per unit time of crossings of given values and the 
expected number of peaks per unit time exceeding given values. Other 
quantities are also of interest for which no simple relations yet exist. 
For example, the expected number of ranges of given intensity, where 
the ranges are the amplitudes of the oscillations as measured from a 
trough to the succeeding peak, is a quantity of particular current in- 
terest in regard to fatigue studies. In addition to the expected number 
for these quantities, information on the sampling variability of these 
quantities is also of interest. 

(2) The second area is that of spectra and cross-spectra estimation. 
The estimations of these quantities are formidable numerical tasks and 











1038 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1935 


the additional development of efficient estimators is needed. The vari- 
ability of some spectra estimators has been established by Tukey and 
others for the Gaussian case. However, comparable results have not as 
yet been obtained for the cross-spectra case. 

(3) Most of the discussion has been limited to linear systems and 
Gaussian processes. These two assumptions are frequently poor ap- 
proximations. Unfortunately, the areas of non-Gaussian processes and 
non-linear operations have, because of their difficulty, so far received 
very little attention. Further developments in these areas appear 


needed before many of the engineering problems can be treated ade- 
quately. 
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USE OF THE RANGE INSTEAD OF THE 
STANDARD DEVIATION 


Gortrrizp E. NoETHER 
Boston University 


Many standard test and estimation procedures require the 
computation of the sample standard deviation. It is, however, 
often possible to replace the standard deviation by the more 
easily computed sample range without appreciably reducing 
the precision of the method. 7 his paper discusses range meth- 
ods which have been suggested in the statistical literature in 
connection with problems about the means and variances of 
one or two normal populations. 


I. INTRODUCTION 


N A great many statistical problems it is necessary to know some- 
thing about the dispersion of the population in which we are inter- 
ested. The dispersion of a population is usually measured by its stand- 
ard deviation ¢. Thus the problem arises to estimate the unknown 
population standard deviation from the information contained in a 
random sample taken from the population. The standard procedure is 
to estimate o by the sample standard deviation. However, from a prac- 
tical point of view, the sample standard deviation has one rather serious 
disadvantage, the amount of labor and time required to compute it. 

Often all that is needed is a quick check in order to find out whether 
a given hypothesis is reasonable or not. We may not want to make a 
final decision at the moment, but only want to get a preliminary indi- 
cation of what to expect in the future. In such a case it is certainly de- 
sirable to have available methods which do not require lengthy com- 
putations involving the use of a calculator. 

As early as 1925, Tippett [7] showed how the range R, i.e., the dif- 
ference between the largest and the smallest observations in the sam- 
ple, could be used as an estimate of o. Obviously, the range is not sub- 
ject to the computational disadvantage of the standard deviation. 
There are not many other statistics of practical importance which are 
as easily computed as the range. 

Since Tippett’s early investigation an ever increasing number of 
papers dealing with the subject of substituting the range for the stand- 
ard deviation has been published in various statistical journals. On the 
other hand, the methods suggested in these papers do not seem to have 
found widespread application except in their simplest form in statis- 
tical quality control. A reason for this is possibly the fact that few 
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textbooks of statistics have dealt with this question beyond the state- 
ment that the range is a rather inefficient estimate of o if based on more 
than 10 or 15 observations. At the same time, the original contribu- 
tions to the problem have often been unknown to the average user of 
statistical methods. 

The present paper discusses certain important range methods which 
have been suggested in connection with problems concerning the 
means and variances of one or two normal distributions. The paper 
is divided into two parts, the first part explaining the procedures for 
estimating unknown parameters (primarily by means of confidence 
intervals) and for testing hypotheses concerning these parameters, 
the second part discussing more theoretical aspects. Thus, when apply- 
ing range methods as short cuts for standard methods, reference to 
Section II is sufficient. However the reader is advised to look also at 
Section III in order to gain a better understanding of the methods in- 
volved, in particular of the extent to which these methods are less pre- 
cise than standard methods. 

It will be seen that especially for problems involving means the 
loss in precision is surprisingly small and for many practical purposes 
is more than compensated for by the greater speed and simplicity of 
working with ranges. Thus the use of range methods, instead of the 


corresponding methods involving the computation of the sample stand- 
ard deviation, seems to be justified in many situations when it is not 
absolutely necessary that the last bit of information be extracted from 
the available data. 


II. APPLICATIONS 


2.1. Notation and General Remarks. The following notation is used in 
the formulas of Section II: 


pw =mean of underlying normal distribution 
o =standard deviation of underlying normal distribution 
N =sample size 
m =number of subsamples 
=gize of subsamples 


wae >> z;=mean 
MN int 
R;=range of the 7th subsample, 7=1, --- , m 
R =(R,+ ---+ R,)/m=mean range 
ay =constant given in Table 1 
Sr=an(Ri + +++ + Ry) =anz 
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R 
| f1 — 42 


R 


G, = 


Jia, +=1, 2, =critical value of G; at significance level a 
h, =factor for lower confidence limit of ¢ 
h. =factor for upper confidence limit of c. 


The methods discussed in this paper require that if the available 
sample contains 12 or more observations the original sample be sub- 
divided into several subsamples of equal size, omitting some observa- 
tions if necessary. The number and size of subsamples will be denoted 
by m and n, respectively. Let N be the total number of observations 
in the original sample. Table 1 shows how m and v should be chosen 
for N =2(1)100. Of course, if N is at the disposal of the statistician, it 
is best to choose it in such a way as to make the omission of any ob- 
servations in the subsequent analysis unnecessary. As will be seen in 
Section III, sample sizes which are multiples of 8 are especially suitable 
when using range methods, though sample sizes which are multiples 
of 7 or 9 are almost as good. 

Whatever the number and size of the subsamples, it is important 
that they be chosen in a random fashion. Thus if the observations are 
given in a random order we might simply take the first n observations 
as the first subsample, the next n observations as the second subsam- 
ple, etc. However, if the observations have been arranged in a system- 
atic fashion, we must select n observations at random from among all 
N observations to form the first subsample, then select another n ob- 
servations from among the remaining N-n observations to form the 
second subsample, etc. 

2.2. Problems Involving Means. Problems involving means can be 
solved with the least amount of computational work by making use of 
two tables computed by Jackson and Ross [3]. These tables give critical 
values, gia, Say, 7=1, 2, for significance levels a=.10, .05, and .01 of 
the two statistics 


g, - L2z#l and G, = 
R 


which can be used to test hypotheses of the type n=yo and w= pe, re- 
spectively. We shall describe now how these tables can be used to find 
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two-sided confidence intervals having confidence coefficient 1—a for 
the mean of a single normal distribution and the difference of the 
means of two normal distributions. One-sided confidence intervals 
having confidence coefficient 1—a/2 are obtained by using only one of 
the limits of the corresponding two-sided confidence intervals. 

A confidence interval for yu is given by 


E—giak <u <E+ giak. 


Here R is the mean range of the m subsamples of size n obtained from 
the original sample of size N; gia is the critical value taken from Table 
I (3]; 2 is the mean of the mn observations used in computing R. 

Ezample 1a: An 1.Q. test has been given to 25 persons with the fol- 
lowing results: 92, 90, 124, 110, 91, 118, 98, 107 | 128, 108, 112, 104, 
100, 103, 105, 97 | 105, 99, 92, 101, 99, 113, 111, 104 | 109. To find a 
confidence interval for the mean score having confidence coefficient .95. 

From Table 1 we find that for N =25 we should use 3 subsamples of 
size 8, which means that we have to omit 1 observation. Assuming that 
our observations are written down in random order, either as a conse- 
quence of the method by which they were obtained or by rearrange- 
ment with the help of random numbers as, e.g., The RAND Corpora- 
tion’s A Million Random Digits, we divide the total sample into sub- 
samples as indicated by the vertical bars. We easily find Ri =124—90 
=34, R2=128—97=31, Rs=113—92=21, so that R=(34+31+21) 
/3=28.7. Also, £=2511/24=104.6. Entering Table I [3] with m=3, 
n=8, a=.05, we find g:,.05=.15. Thus the confidence interval for yu is 
given by ‘3 


104.6 — .15 XK 28.7 < wu < 104.6 + .15 X 28.7 


100.3 < » < 108.9. 
The corresponding confidence interval based on Student’s ¢ is given by 
100.8 < » < 108.8. 


To test the hypothesis u = wo against the alternative uo we can use 
the above confidence interval, or, if we prefer, the statistic G, as indi- 
cated in [3]. 

One-sided tests at significance level a/2 are carried out by comparing 
the statistic 

=E— 4p 


Git = 
’ R 
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with +912 depending on whether the alternative is p> jo or p<. 

Example 1b: Using the data of Example 1a, to test the hypothesis 
that 1 = 108 against the alternative that u <108. 

104.6 — 108 
Gi) = a 12. 
28.7 

Since —.12<—.10, the .05-significance point for the one-sided test, 
we reject our hypothesis at the .05 level. 





The use of the statistic G. in comparing the means of two populations 
is complicated by the necessity of dividing both samples into sub- 
samples of equal size n. Once this has been achieved, a confidence inter- 
val for ui— 2 having confidence coefficient 1—a is given by 


(4: — 42) — Jeak < wi—me < (4%: — FZ) + Jrak. 


Here R is the mean range of the m,-+mz subsamples of size 7; go, is 
the critical value taken from Table II [3]; #, and # are the respective 
means of the observations used in computing R. 

If the original two samples are of equal size, Ni = N2=N, the values 
of n and m:=m2=m can be determined from Table 1. If Nix Ne, the 
division into subsamples suggested in Table 1 cannot be used unless 
we should have n; =n as, e.g., for Ni: =38, Ne=46 for which n=n.=9. 
If according to Table 1 n:+ne, as a general rule, it is best to choose as 
n that number between 6 and 10 which requires the smallest total 
number of observations to be omitted. However subsamples of size 5 
or 11 should also be considered if their use leads to a considerable re- 
duction in the number of observations to be omitted, as, e.g., for 
Ni=15, Ne=20 or Ni=22, N2=34. Of course, if at all possible, the 
simplest solution is to choose N; and N- in such a way that both are 
multiples of the same number n, with n preferably 8 or a number close 
to 8. 

Example 2a: Two kinds of rations fed to test animals for a given 
period of time produce the following weight gains: 


Ration 1: 

42, 38, 34, 33, 37, 35 | 35, 38, 33, 30, 36, 39 | 37 
Ration 2: 

33, 34, 33, 30, 36, 32 | 28, 32, 30, 36, 33, 35 | 31. 


To find a confidence interval for the difference in weight gain having 
confidence coefficient .99. 
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Since Ni=N2= 18, we find from Table 1 that each sample should be 
divided into 2 subsamples of size 6, omitting thus 1 observation from 
each sample. Using the subdivision as indicated by the vertical bars, 
we find R=(9+9+6+8)/4=8. Also, #:=430/12=35.8, %#.=392/12 
=32.7, #:—4#.=3.1. Thus the confidence interval becomes 


3.1 — 46 X8 <i — we < 3.1 + 46 X 8 


—.6 < mi — we < 6.8. 


Tests of hypotheses concerning the difference y:— je are carried out 
either by referring to the corresponding confidence interval or by using 
the statistic G, directly. 

Example 2b: For the data of Example 2a, 


G, = — = .39 
8 


which according to Table II [3] is significant at the .05 level, but not 
at the .01 level. The second statement follows already from the con- 
fidence interval given above. 

One-sided tests at significance level a/2 are carried out by comparing 


with + goa. 
Above methods for comparing the means of two normal populations 
can be used only if it is possible to assume that the two populations 


have equal variance. 
2.3. Problems Involving the Standard Deviation. An unbiased esti- 
mate of o is given by 


Sr = and 


where ay is given in Table 1 and 2=R,+R2+ --- +R», Ri, 7=1, 2, 
+++, m, being again the range of the 7th subsample. 
A confidence interval is easily found with the help of Sr: 


hiSre<o <hSe 


where h; and h, are the factors tabulated in Table 1 for confidence 
coefficients .90 and .98. 
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TABLE 1 


SUBSAMPLE DETERMINATION AND ESTIMATION 
CONSTANTS FOR ¢ 

















Coefficient Approximate factors for 
Sample Number __ Size for confidence limits of ¢ 
rtd of sub- of sub- unbiased 
N samples samples’ estimate Conf. coeff. 0.90 Conf. coeff. 0.98 
m n of « ieee, 
an hi he hi hy 
2 1 2 .886 41 12.7 31 63.7 
3 1 3 .591 5 3.9 41 8.9 
4 1 4 .486 .57 2.7 47 4.8 
5 1 5 .430 .60 2.3 51 3.5 
6 1 6 .395 — .63 2.0 .53 2.9 
Y | 1 7 .370 .65 1.88 .55 2.6 
8 1 8 .351 .66 1.78 .57 2.4 
9 1 9 .337 .68 1.71 .58 2.2 
10 1 10 .325 — .69 1.66 .60 2.1 
1l 1 1l 315+ .70 1.61 .61 2.0 
12-13 2 6 .197 71 1.57 .62 1.94 
14-15 2 7 .185— .72 1.50 .64 1.80 
16-17 2 8 .176 .74 1.45 .66 1.70 
18-19 2 i) .168 .75 1.43 .68 1.67 
20 2 10 .162 .76 1.41 .69 1.65 
21-23 °3 7 .123 .77 1.39 .70 1.62 
24-26 3 8 .117 .78 1.36 71 1.56 
27 3 9 112 .79 1.34 42 1.52 
28-29 4 7 .0924 .79 1.33 43 1.51 
30-31 3 10 .108 .80 1.31 .74 1.49 
32-34 4 8 .0878 .80 1.30 .74 1.47 
35 5 7 .0740 81 1.28 .75 1.44 
36-39 4 9 .0842 81 1.28 75 1.43 
40-41 5 8 .0702 .82 1.27 .76 1.41 
42-44 6 7 .0616 .83 1.26 77 1.39 
45-47 5 9 .0673 .83 1.25 77 1.37 
48 6 8 .0585 + .83 1.23 ‘ 1.35 
49 7 7 .0528 .83 1.23 .78 1.35 
50-53 5 10 .0650 .83 1.22 .78 1.34 
54-55 6 9 .0561 .84 1.22 .79 1.33 
56-59 7 8 .0502 .84 1.21 .79 1.32 
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TABLE 1—(continued) 








Coefficient Approximate factors for 

Number Size for confidence limits of ¢ 
of sub- of sub- unbiased 
samples samples estimate Conf. coeff.0.90 Conf. coeff. 0.98 
m n of o 
an hi he hi ha 











-0542 -85 .20 .80 31 
.0411 85 19 .80 .30 
.0439 85 19 .80 .30 
.0359 85 -19 .80 .30 


oan oe 


.0370 .86 19 81 .29 
-0390 .86 18 81 
.0336 .86 18 .82 
-0304 86 18 .82 


ee ee 
anon 


-0351 .87 aT 82 26 
.0374 .87 17 ‘ 26 
.0308 87 17 ‘ 26 
-0319 .88 .17 .83 


9 
12 
11 


ono 


-0337 .88 16 .83 .25 
.0284 .88 -16 .83 .24 
-0293 .88 16 .83 .23 
.0264 .88 15 .83 .23 
.0306 .88 15 .83 .23 


90 10 
13 
12 
98 14 
99 11 


9 
7 
8 
7 
9 


a" 
—) 


100 10 .0325 — .88 1.15 .83 1.22 





Example 3a: Suppose that for a set of 75 observations we have 
found > =295, = being based on 9 subsamples of size 8 omitting 3 ob- 
servations as indicated in Table 1. Then Se=.029X295=11.5. The 
confidence interval with confidence coefficient .98 becomes 


81 X 11.5 < ¢ < 1.28 XK 11.5 


9.3 <6 < 14.7. 


The two end points of the confidence interval given above can also 
be obtained graphically. The equations o=h)Sp and o=h,Sp represent 
two straight lines through the origin in the (Sp, o)-plane. Having ob- 
served a particular value Sp, we draw a line parallel to the c-axis at 
distance Sr. This line intersects the other two lines in the end points of 
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the confidence interval. This graphical method is certainly advisable 
if repeated confidence intervals based on samples of the same size 
are to be found. 

As usual, tests of hypotheses concerning o can be carried out by 
checking whether the hypothetical value of ¢ lies in the corresponding 
confidence interval or not. Alternatively, the percentage points of 
Sr/o tabulated by Grubbs and Weaver [2], pp. 233-4, can be used. 

Example 3b: Using the information of Example 3a, to test the hy- 
pothesis that o<10 against the alternative that ¢>10. Sr/oo=11.5/10 
= 1.15. From Grubbs and Weaver’s table for 9 subsamples of size 8 we 
find the (one-sided) .05 significance value as 1.16. Thus we do not have 
sufficient evidence to reject our hypothesis at the .05 level. 


Since in testing the hypothesis y,=ye2 it is necessary to make the 
assumption that o:=02, it would be desirable to have a quick range 
test for ascertaining whether this assumption is reasonable or not. Un- 
fortunately, tables for such a test exist only for the case when the num- 
ber of observations in each sample does not surpass 10. 

The test consists in forming the ratio of the two ranges putting the 
larger range in the numerator. The null hypothesis of equal variance 
is rejected if the observed value of the range ratio is too large. Critical 
values for the range ratio are given in [1], pp. 317-8, Table 8d. In using 
the table, nm, refers always to the number of observations on which 
the range in the numerator is based. As in the case of the corresponding 
F-test, the significance level of this test is twice the probability level 
indicated in the table. 


III. DISCUSSION 


We shall now discuss the theoretical basis and justification of the 
procedures given in Section II. 

3.1. Determination of Size of Subsamples. Let R= Rin) stand for the 
range of a sample of size n. It is easily shown that there exist constants 
d, depending on n but not on o (nor the mean u of the underlying nor- 
mal distribution) such that the expected value of RF is d,o, 


E(R) = dye. (1) 


thus d, is the expected value of the range of a random sample of size 
n from a normal distribution with variance 1. Numerical values of 
d, were computed by Tippett [7] for values of n between 2 and 1000. 

We can use (1) to find an unbiased estimate of o, namely R/d,. For 
small values of n, this estimate is only slightly less accurate than that 
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pased on the sample standard deviation s, where s =[Z(x—#)*/(n—1)}'” 
However as 7 increases beyond 12 or 15, say, the loss in accuracy soon 
becomes considerable. There is also another reason which will be dis- 
cussed at a later point why we should not use ranges based on samples 
of too large a size. This means that for such values of n our procedure 
has to be modified somewhat. Instead of computing the range from 
the entire sample, it is now necessary to divide the given sample into 
a number of subsamples and then compute a weighted mean of the 
various subsample ranges. 

Let then N denote the total number of observations available. 
For N =2(1)100, Grubbs and Weaver [2] have determined the particu- 
lar subdivision of the total sample (and corresponding weighting 
factors) which produce the unbiased estimate of o having minimum 
variance. Actually the method of Grubbs and Weaver can be further 
simplified. As an example, take the case when N =58. According to [2] 
we should divide the given sample into 5 subsamples of size 8 and 2 
subsamples of size 9. The sum of the ranges of the 5 subsamples of size 
8 is multiplied by .0485. To this is added the sum of the 2 ranges of the 
subsamples of size 9 multiplied by .0521. The standard deviation of this 
estimate is .1070c. We can save one of the two multiplications by omit- 
ting 2 of the observations and dividing the remaining 56 observations 
into 7 subsamples of size 8. The estimate of ¢ based on this method has 
standard deviation .1088¢, which is not very much larger than that of 
the earlier estimate. This latter method certainly has the advantage of 
greater simplicity and symmetry without unduly increasing the stand- 
ard deviation of the resulting estimate. In general, by omitting some of 
the observations if necessary, it is always possible to divide the re- 
maining observations into subsamples of equal size n. 

Given a sample of size N, the question arises how n should be chosen. 
A good procedure is to choose for n that value between 6 and 10 which 
requires the fewest number of observations to be omitted. If for a given 
N this can be done in more than one way, subsamples of size 8 are pref- 
erable to those of size 7 or 9, which in turn are preferable to subsam- 
ples of size 10 or 6. This rule has been used in the construction of Table 
1 for N =2(1)100. 

The division into equal subsamples given in Table 1 is not always 
the one leading to the smallest possible standard deviation of a range 
estimate based on equal subsamples. Thus for N = 26, a division into 
2 groups of size 13 would result in an estimate having standard devia- 
tion .1630 instead of .166¢ for the estimate based on 3 groups of size 8, 
omitting 2 observations as suggested by Table 1. There are a few addi- 
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tional cases. The difference in standard deviation is always negligible. 
The reason why the division into subsamples given in Table 1 is prefer. 
able to others giving possibly a smaller standard deviation is that, ex. 
cept for N=15, such other subdivisions correspond to subsample 
sizes n>10. Now throughout this paper it is always assumed that the 
underlying distribution is strictly normal. However, in practice, slight 
deviations from normality, especially in the tails of the distribution, 
usually occur. Such deviations from normality are apt to influence 
ranges based on a large number of observations much more than those 
based on a small number of observations. It is, therefore, advisable to 
keep the subsample size relatively small. 

3.2. Test and Estimation Procedures for «. Given a sample of size N, 
it is divided into m subsamples of size n as indicated in the previous 
section omitting some of the observations if necessary. Let Ri, -- -, 
R,, denote the ranges of these m subsamples. By (1), an unbiased esti- 
mate of ¢ is given by 


1 
—<— (Rit -:+:+Rn).- (2) 


If we set 1/md,=ay, we get the unbiased estimate of o given in Sec- 
tion 2.3. 
It follows immediately from (2) that the variance of Sz is 


var (Sr) = o7a,7/m (3) 


where a,” is the variance of Ri»)/dn, Rin) being this time the range of a 
random sample of size n from a normal distribution with variance 1. 
For the convenience of the reader, Table 2 below gives the values of 
dn, 1/dn, o,”, and o, for n=6(1)10. These values can be used, e.g., when 
dealing with the case N > 100. 
As already mentioned in Section 2.3, Grubbs and Weaver [2], pp. 


TABLE 2 
RANGE CONSTANTS 








1/d, 





395 
.370 
.351 
.337 
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993-4, have tabulated approximate percentage points of the distribu- 
tion of Sr/o (Grubbs and Weaver use the notation R,/d, for Sp) 
which are surpassed with probability .995, .990, .950, .050, .010, .005 
for m=1(1)15 and n=6(1)10. The values h, and h, in Table 1 are sim- 
ply the reciprocals (after some necessary smoothing) of these percent- 
age points. The tables by Grubbs and Weaver also give the values of 
on'/M. 

on accurate are the range procedures discussed in 2.3 in compari- 
son with the standard methods based on the sample standard deviation 
3? We can compare the two estimates of ¢ by computing their relative 
efficiency é, i.e., the ratio of their variances. We can interpret 1—e as 
the proportion of observations which are “wasted” by using the less 
efficient estimate, i.e., the estimate having the larger variance. In 
our case, the quantity e can also be used in comparing the two test 
procedures based on Sz and s, respectively. 

If N is sufficiently large, we have to a good approximation var (s) 
=¢?/2(N—1). Thus by (3) 


1 
2(mn + 1 — 1) 7 1 








on” l-—1 
2no,? (1 - ) 





m mn 


where | is the number of observations omitted in computing Sp. Since 
for most values of N, lis either 0, 1, or 2, we get a good idea of the effi- 
ciency of the estimate Sz by computing 

, 


1 
e = . 
2no,” 


We get the following table: 


6 7 10 
744 752 ‘ 751 745 


This table bears out our earlier assertion that when using range meth- 
ods subsamples of size 8 are best with subsamples of size 7 and 9 being 
almost as good. It further shows that the relative efficiency of Sr is 
approximately .75 at least for sufficiently large samples. From a, theo- 
retical point of view, this means that we are “wasting” one out of 
every four observations by using Sp instead of s. But let us look at the 
question from a somewhat different point of view. 
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We shall assume first that we are interested in the estimation prob. 
lem. It seems to this writer that from a practical point of view, instead 
of looking at the efficiency e, we get a better idea of the loss in accuracy 
by comparing the average length Lr of the confidence interval for , 
based on Sz with the average length L, of the confidence interval based 
on s. From the formula for the confidence interval given in Section 23 
we easily find 

Lr = Sr(hu — i), 
and similarly, 
L, = s(hu’ — hi’) 


where h,,’ and h,’ are the corresponding factors based on the s/o dis- 
tribution. Thus E(Lr)=oc(hu—h;) and H(L,) =co(hy’—h:z’) where ¢ 
is defined by E(s) =co. 

The following is a table of approximate values of E(Lr)/E(L,) for 
various values of N and confidence coefficient .90: 





N 25 50 75 100 
E(L 

(Le) 1.17 1.16 1.15 1.15 
E(L,) 


Thus the average length of the confidence interval based on Sz is 
about 15-17 per cent longer than that based on s. 

Let us now take the problem of testing a hypothesis concerning zg, 
say, ¢=o against the alternative ¢=01><¢o. In considering a test of 
this hypothesis we should certainly ask the following question. How 
much larger than oo must o; be so that we can be reasonably certain to 
reject our hypothesis when the alternative is true? In other words, if 
we set o1= ao, for what value of \ do we have probability 1—8, where 
8 is some small number, of rejecting the hypothesis tested. It is well- 
known that A =Xal X,8 where Xp? is the (1—P)-percentile of the Chi- 
square distribution with N —1 degrees of freedom. Similarly, it can be 
shown that dz is the ratio of the corresponding percentiles of the Sr/c 
distribution. For a=8=.05, we find the following table of approximate 
values of Ag/)s: 


N 25 50 75 100 
AR 
7 1.08 1.05 1.04 1.04 





1In the customary quality control notation, ¢=[N/(N—1)]/%1 where c:=(2/N)¥2r(N/2) 
/T(N/2—1/2). 


USE 








or 
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indicating that o; has to be about 4-8 per cent larger when using Sz as 
the test statistic than when using s. 

There is one other point which should be mentioned. It follows from 
an investigation by King [4] that the loss in efficiency discussed earlier 
js primarily due to the fact that our range methods require the sub- 
division of the original sample into m subsamples. This produces a loss 
of m—1 degrees of freedom, a loss which cannot be made up without 
substantially complicating the whole procedure. Actually, if the origi- 
nal data consist of m sets of n observations each (n reasonably small, 
say, <12), the efficiency of the range estimate Se compared to the 
standard pooled estimate of ¢ is almost 1. 

3.3. Problems Concerning the Mean. The standard statistic for testing 
hypotheses concerning the mean of a normal distribution is Student’s ¢, 


./N(z — h) 


s 


{ = 


It seems then reasonable to investigate the distribution of a statistic of 
the form 
_ Vmn(é — u) 

Sr 





u 


This has been done by Lord [5]. Actually, the statistic 


le—yl 
,_3 ee 


R 


which is closely related to u requires fewer computations and is there- 
fore preferable from a practical point of view. Critical values for this 
statistic have been computed by Jackson and Ross [3] from those given 
by Lord. If we denote these critical values at level a by gia, the confi- 
dence interval for u given in 2.2 follows immediately. 

Again we can get an idea of the loss in accuracy due to the use of G; 
instead of ¢ by comparing the average lengths of corresponding confi- 
dence intervals or computing a quantity similar to X. 

From 2.2, the length of the confidence interval for u based on G; is 
Lqe=2giak. Similarly, the length of the confidence interval based on 
tis Ly=2t.s/+/N, where t, is the corresponding critical value for the 
t-distribution. Thus E(L¢)=2giadno and E(L;) =2taco/+/N where, as 
before, c is a constant such that E(s) =cc. 
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For confidence coefficient .95, we get the following table: 





N 25 50 75 100 
E(Le) 1.03 1.00-+ 1.02 1.00+ 
ae 


Thus in the two cases (N = 25, 75) when it is necessary to omit some of 
the original observations, E(Lg) is between 2 and 3 per cent larger 
than E(L;), while in the other two cases the difference in length is 
negligible for all practical purposes. 

From the small difference in the lengths of the two confidence inter- 
vals we may already suspect that the loss in accuracy when using the 
G,-test instead of the ¢-test is also small. This is borne out by a study 
by Lord [6]. Thus for a=.05 and .01, Lord has computed quantities 
p such that when using two-sided critical regions of size a the hy- 
pothesis 4 =o is rejected with probability 1—a if actually up =p0+ po;. 
If we therefore use pg and p; for the G,- and t-tests, respectively,’ 

Pa/ JV mn Pa N 


po) /N pV mn 


measures by how much more the true value u must differ from the hy- 
pothetical value uo in order to be recognized by the G,-test with the 
same probability 1 —a as by the /-test. For a=.05 and the usual sample 
sizes, this quantity turns out to be 1.03, 1.00+, 1.02, 1.00+, respec- 
tively. 

Similar results hold also in the two-sample case. We may then con- 
clude that for many practical purposes the loss in accuracy due to 
using G-statistics instead of the standard ¢-statistics is unimportant. 
This is particularly so if the sample sizes are chosen in such a way that 
the computation of the mean range R does not require the omission of 
any observations. Thus we are certainly justified in using the methods 
discussed in this section quite generally. 





2 See note 3 in this connection. 

3 In his original paper, Lord suggests that, when computing the statistic u, all observations be used 
in finding # in the numerator, even though it may be necessary to omit some of the observations in 
computing Sg in the denominator. If this is done, the loss in accuracy of the u-test as compared to the 
t-test is given by the expression pg/p:, which for a =.05 and the usual sample sizes teke the following 
values: 1.01, 1.00+, 1.00+, 1.00 +. Thus any loss in accuracy is not so much due to ‘ie use of the mean 
range instead of the standard deviation, but due to the omission of some of the observations in the 
computation of 2. This loss in accuracy can be avoided if the original sample size is chosen in such a way 
that the subsequent analysis of the data by means of the G.-test does not require the omission of any 
observations. 

These remarks apply also to confidence intervals. In addition, if no observations are omitted, the 


confidence intervals based on G: and ¢ are centered at the same value Z, and therefore differ only in 
length. 





USE 
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A DIAGRAMMATIC REPRESENTATION OF THE SUM of 
SQUARES AND PRODUCTS 


Cc. C. Li 

University of Pittsburgh 

This paper gives a geometric illustration of some of the 
commonest identities in the algebra of the sum of squares, 
The method is essentially the construction of squares of the 
various numbers and their means. These pictures provide ys 
with a visual representation of what the identity says. The 
method is later extended to represent the identities concerning 
the sum of products by constructing rectangles. 


I. INTRODUCTION 


wo of the commonest identities in algebra of the sum of squares 
are 


(Le net 2), 


(1) 


kon kom; k 

Vd wig - = VUE. 4s)? + Vins -H. 2) 

i=l a=] i=1 a=l i=] 
The first identity refers to a single group of N numbers, 2a, a=1, 
2,---, N, and = >> 2/N, the arithmetic mean of the N numbers. 
The second refers to the case of N numbers subdivided into k groups 
(t=1, 2, - - - , k) with n; numbers in the 7-th group, and m+m+ - -- 
+n.=N. Both may be demonstrated, in one way or another, by 
straightforward algebra. For non-mathematical beginners, however, a 
visual demonstration is almost always preferred. In his teaching, the 
writer finds it extremely useful to illustrate the relations (1) and (2) by 
geometric construction. The usual algebraic derivation is also given in 
the class. The following diagrammatic representations are not meant 
to be formal “proofs” but are simply pictures to illustrate what the 
identity says. 


II. REPRESENTATION OF IDENTITY (1) 


With the given N numbers, (i) draw a base line of length T=2,+22 
+ --- ay, (ii) cut the base line into N segments, the first segment of 
length xz, second of length 22, etc. and (iii) construct a square for each 
single segment. Then the total area of this series of squaresis )\%.7= A, 
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which is represented in the top row of Figure 1; in our example the num- 
pers are Ta = 6, 2, 9, 3, with 7T’=20, cut into 4 segments. 

Next draw another base line of the same length 7’, but this time cut 
it into N segments of equal length. The segment length is then #; in 
our example, #=20/4=5. As before, construct a square for each seg- 
ment. The total area of this series of squares is #7+2#?+ ---#=Nz 
=(, represented by the middle row of Figure 1. At this point it should 
be said that the area A, comprising N squares of unequal size, is always 
larger than the area C, comprising N squares of the same size on the 
same base line of length 7’. In other words, for any fixed length 7 and 
fixed number (NV) of segments to be cut, the area NZ is a minimum. 
As a matter of fact, the larger the differences between the segments, 
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the larger will be the area A. If the student is not convinced, suffice 
it to point out that in cutting the length 10 into two segments 
92+ 12>8?+42?>72+3?>6?+4? >52+5?. 

Finally, as illustrated in the bottom row of Figure 1, take the differ. 
ence 2.—# for each pair of corresponding segments and construct a 
square for each of them. The total area of N such squares jg 
>> (ta—#)?=D. In the example, the area is equal to 


D = (6-5)? + (2-5)? + (9-5) + (3-5)? 
=1+9+4+ 16+ 4 = 30. 


The area D is exactly equal to the difference between A and C. Thus, 
the identity (1) may be written in terms of the three areas: 
A-C=D; (1) 


in our example, 130—100=30. Once this idea is clear, the last series 
of squares (area D) may be omitted from the diagram and simply be 
referred to as the difference A —C. This will be done in the next section, 




























III. REPRESENTATION OF IDENTITY (2) 


The chief value of the method described above lies in the ease and 
clarity with which the identity (2) may be demonstrated. We shall 
take as our example the ten (VN = 10) numbers 6, 1, 2, 8, 2, 7, 6, 2, 1, 5, 
and shall further suppose that they are divided into three (k =3) groups. 
The first group with n,=3 being 6, 1, 2, the second group with n.=5 
being 8, 2, 7, 6, 2, and the third group with ns=2 being 1, 5, 80 
that n:+n2+n3= N = 10. Construct the areas A and C as before, regard- 
less of the grouping; the length of the base line is T= >> > ri, =40 
as shown in Figure 2. The areas are: 


A=) Dd 21,2 = 224 
C = Nz? = 160 
A-C=)D Dd (ui, - #? = 64 


The last expression is the left-hand member of identity (2) and is the 
total sum of squares of deviations from the general mean. This value, 
of course, has nothing whatsoever to do with the grouping of the num- 
bers. 

Now, in between A and C, draw another base line of the same length 
T. Cut the base line into k “major” segments, so that the first major 
segment length is the total of the first group of numbers; that is, 
Ti= Doe M1,=6+1+2=9, and T;+72:+ ---+7,=T. Then cut each 
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major segment 7’; into n; equal parts; the length of these minor seg- 
ments is #;, the mean value of the ith group. Construct a square on 
each of the minor segments. The total area of these series of squares is 


B = maki? + nef? + --- +n,.i,? = 170 (3) 


as shown in the middle row of Figure 2. By the argument advanced in 
Section II, we know not only that A is larger than B, but also know by 
how much A exceeds B. Comparing group by group, the first m 
squares in A have an area larger than 7,4,” in B, and this is true for every 
group. Hence, 


A-B= (Xa - ma) + ++ (Line - ma) 


= > (nm, —&)? +--+) (te, — Zx)?. 


which is the first term of the right side of (2). 
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Similarly, we see that B is larger than C. Using the principle em. 
bodied in identity (1) once more, we obtain 


B-—C=ni?+-:-+> + md? — NZ = Di ni(%: — %)*. 


In brief, any sort of grouping will yield an area intermediate between 
A and C, because the series of squares in B is more even in size than that 
in A, but not as even as that in C. 

On re-examining the identity (2), we see that it says nothing more 
than the obvious truth that 


A-C=(A—B)+(B-(OC). (2’) 


In practical calculation all we need to compute are the three quanti- 
ties, A, B, C. 


IV. FURTHER RELATIONS 


Using the general procedure of constructing squares, a number of 
other simple algebraic relations may be illustrated. For example, if a 
single big square is constrctued on the base line T of part C in Figure 1, 
we obtain a diagrammatic representation of the relation 


C = Na = T?/N. 


Similarly, if squares are constructed for each major segment in B of 
Figure 2, we see that 

T;? T,.? 

Bid dee Hoi Cie (3’) 

m1 Nk 
The procedure of Figure 2 may be extended to the case of subgroupings 
within groups. The more complicated identities for the general “box- 
within-a-box” type of data may thus be written down immediately 
without going through the algebra. 


Vv. SUM OF PRODUCTS 


The diagrammatic method for the sum of squares may be readily 
modified to represent the corresponding identities concerning the sum 
of products of two series of numbers: 


Lis — Nay = 2) (x - ay — 5) (4) 
DL (tie — Ayia — 9) = DD ia — ¥:)(Yia — 51) 


‘ 


5 
+ Lo nil: — £)(5: — 9). - 
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e em. Detailed explanation seems unnecessary. In the example we consider 
the following four (N =4) pairs of numbers: 
x = 6, 2, 9, 3; T, = 20, 


Ween y = 4, 1, 5, 2; Ty = 12. 


that The area of the series of rectangles x-y is (Fig. 3, top) 
A =24+2+445+4+6 = 77. 


More 


(2') 


anti- 


T of 
if a 
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From the bottom diagram of Fig. 3 we see that the area of the N ree. 
tangles, each of size £-7, is 
Sate 


C = Nj = = 60. 
N 


Hence, 
~ («@ - ay - 5) =A-CH= 17. (4') 


Next, let us consider (5). Similar to the sum of squares, the sum of 
products of deviations, A —C, may be subdivided into two components 
—one for “within groups” and one for “between groups.” For simplicity 
we divide the four pairs of (xz, y) in our example into two groups, the 
first group comprising the first two pairs of (z, y), and the second group 
the last two pairs (n.+n2=2+2=4). Using the values of #; and j;, 
the group means of z and y, we construct the series of rectangles as 
shown in the middle diagram of Fig. 3, whose area is 


B= mij + NeF2Ve 


5 7 
2(4x =) +2(6 x =) = 20+ 42 = 62, 


Again, the identity (5) simply says: 
A-—-C=(A—B)+(B-(C). 
In our example, 
17 = 15+ 2. 


One important difference between the sum of squares and the sum of 
products is that the former is always a positive quantity while the 
latter can be negative. The case of negative sum of products may also 
be visualized easily by the aid of diagrams. For instance, if the four 
pairs of numbers are: 


x = 6, 2, 9, 3; 
y = 2, 5, 1, 4; 
the area of the four rectangles will be (Fig. 4) 
A=12+10+9+ 12 = 43, 
which is smaller than C = N#j=60. In this case, 
Dd (x — )(y — 9) =A —C = 43 — 60 = — 17. 
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Any component of (5) may be positive or negative, depending on the 
paired values of z, y. 





THE ANALYSIS OF VARIANCE: A GRAPHICAL 
REPRESENTATION OF A STATISTICAL 
CONCEPT* 
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AND 
Eric Reiss 
Washington University 


The analysis of variance is presented in graphical form to 
emphasize the conventional partitioning of the sums of 
squares. Areas are defined representing squares of non-uni- 
form series and sums of squares of uniform series, and the 
difference is displayed as the variation of the non-uniform 
series. The graphical presentation is extended to intra- and 
intergroup comparisons and to the F test. The two-dimen- 
sional analogy is presented as an aid to the teaching of sta- 
tistics supplementing the algebraic formulations. 


I, INTRODUCTION 


T:. analysis of variance has been discussed in various monographs 


and texts on statistical analysis [1, 2, 3, 4, 5]. The mathematical 
formulations in an introductory presentation not uncommonly appear 
as somewhat of an enigma to the uninitiated. This paper illustrates a 
a graphical representation of a statistical concept, and the authors be- 
lieve that it is a useful pedagogical technique. In no sense is it proposed 
that a graphical presentation should replace the mathematical argu- 
ments. However, a visual, two-dimensional analogy of certain con- 
cepts basic to the understanding of the analysis of variance may sup- 
plement the mathematical symbolism for those to whom the mathe- 
matical expressions lack a little warmth on first acquaintance. 

It is well known that if a sum is divided into a given number of parts, 
the sum of the squares of these parts will be a minimum when all the 
parts are equal. From this proposition a method of measuring the vari- 
ability, or lack of uniformity, in any series at once suggests itself to the 
student, since the difference between the sum of the squares of a non- 
uniform series and an appropriate uniform series might prove to be a 





* The concepts presented are an extension and elaboration of material reviewed by the authors at a 
seminar held in July, 1952, by the Surgical Research Unit, Brooke Army Medical Center, Fort Sam 
Houston, Texas, under the command of Colonel William H. Ampspacher, M.D. 

The authors gratefully acknowledge the helpful advice and criticism by Frederick Mosteller in the 
preparation of this paper. 
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sensitive index of the lack of uniformity. This difference of the sums 
of squares is, of course, a standard measure in the analysis of variance 
es the following well-known relationships indicate: 


D(X — X)? = xX? - = = > x?— NY. 


In this paper the term variation is reserved for quantities representing 
the difference of the sums of squares of non-uniform and uniform series. 

This orientation to the comparisons inherent in the analysis of vari- 
ance is not always emphasized, and it must be clearly understood to fol- 
low the subsequent arguments. From one source [1] we present with 
minor modifications the formulas in the analysis together with the 
definitions of the symbols used in this paper: 


Xi, X2,--- X, and Yi, Y2,--~- Y, are specific terms in Group X 
Data, Group Y Data, etc.; 

m represents the number of groups of data; 

V represents any term in any group; 

V represents the mean of all terms; 

Vx represents the mean of terms in the K data; 

N represents the total number of terms; 

Nx represents the number of terms in the K data; 

> represents a summation over the entire series; 


>, indicates a summation over the m groups of data; 
Nx 
>, indicates a summation over the K data. 


1 
If each group has the same number of terms, then variation within 


groups is defined as: 
m NK 2 
arts ¥( 27) 
>| Law - ve] = Dv- 


Nx 
«- Pe~ Saws. 


1 





Also variation between groups is defined as: 
m NK 2 
" 
2(> (vy 
Nr N 
> (WeVe) — NV" 


1 





. [Nx(Vx - V)?] as 
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if there are the same number of terms in the different groups. The 
fact that differences of squared quantities are equated with certain 
statistics used in the analysis of variance is clear if the formulas are 
expressed as above. 


II. GRAPHICAL PRESENTATION 


In Fig. 1 are represented Group X Data consisting of a series of 4 
terms with the values as shown. These values are represented as suc- 
cessive distances along the edge of a square, and from the geometrical 
construction the squares of these values are indicated by the stippled 
areas running diagonally across the large square. The cross-hatched 
area represents the sum of the squares of a hypothetical series of equal 


GROUP X DATA 












































10 
=— = 2. 
4 5 


v(x - X= ox 47 
1 1 


= 1.5? + 3.0? + 1.0? + 4.5° — 4 X 2.5? 
= 32.5 — 25.0 = 7.5 


Fia. 1 
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numbers containing the same number of terms and having the same 
mean. In the expressions for the computation of variation below the 
figure, the first term in the difference is represented by the sum of the 
stippled areas and the second term is represented by the cross-hatched 
area, i.e., 4X°. The difference of these two areas is numerically equiva- 
lent to the variation of the Group X Data. The arithmetical computa- 
tion in Fig. 1 reveals this to be 7.5. 

In Fig. 2 are represented Group Y Data consisting of a series of 4 
terms with the values as shown. This series is peculiar in consisting of 
4 identical values. By direct analogy with the discussion above, the 
sum of the squares of this series is equal to the area represented in the 
second term of the difference in the expressions below the figure. Since 
the difference in area is 0, the variation is likewise nil. 


GROUP Y DATA 
Ys 


¥ =—— =— =25 


4 


Ly - Fy = oy: — 47" 


= 4X 2.5° — 4 X 2.5? 


Fia. 2 
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In Fig. 3 are represented Group Z Data consisting of a series of 4 
terms of greater disparity. The variation here is 27.5. 

In summary the considerations thus far have attempted to elucidate 
by graphical methods the implications of the difference between cer. 
tain squared quantities in the analysis of variance. In each instance it 
is the difference between the sum of the squares of a particular series 
and the sum of the squares of a uniform series with an equal number of 
terms and sum and mean. The differences in the areas found have been 
displayed as the variation in a statistical analysis. The values of 7.5, 0, 
and 27.5 are each an index of the variation within a series. The com- 
parison of the Group X, Y, and Z Data with each other presents g 
more complicated problem and a more complicated geometrical 


GROUP Z DATA 
Zs 
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analogy. To determine the variation between these three series, in dis- 
tinction to the variation within any given series, areas must be defined 
representative of the sum of the squares of the terms of each series had 
each series been composed of uniform values, and the sum of these 
squared values must be compared to an area based on the sum of the 
squares of a hypothetical series of 12 equal terms with a sum and mean 
equivalent to the grand sum and grand mean of the combined three 
series. 

In Fig. 4 are represented the combined data of the previous three 


COMBINED GROUP DATA I 


Xi = 1.5 
X3 =3.0 
X;=1.0 
X,=4.5 
110--75 =35 
Y; =2.5 
Y;=2.5 


Y,;=2.5 
Y,=2.5 
Z,=1.0 


Z,20.5 7 
Z:=1.5 7 


LZ =7.0 





.-. rr = 
V = —— =— = 2.5 


N 12 


es — 3 
> vt —~> (Never) 
1 
=110-—-3X4 X 2.6 

= 110 — 75 = 35 


Within groups: 3 [ ¥ (V - 7x) | 


m _ i. - _ —8 
Between groups: 7 [x K(Vx — ¥] } * (NxVx) — NV 
1 1 


=3 xX 4 X 2.5% — 12 X 2.5 
= 75 -—75 =0 


Fie. 4 
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figures. In addition there are two squares in the right upper corner: 
the larger one designated by the dash-dotted line AA and a smaller one 
designated by the solid line BCBC. The square area between the line 
AA and the corner represents that obtained by adding the squares of 
tne 12 individual terms. The square area defined by the line BCBC rep. 
resents the area obtained by adding the squares of 12 equal terms with 
a sum equal to 30. This area is also equal to one obtained from the sum 
of the squares of the individual terms had each series within itself been 
composed of uniform values. In other words, the area AA is equal to the 
sum of the stippled areas, and the area BCBC is equal to the sum of the 
cross-hatched areas within each individual series and is also equal to the 
sum of the squares of 12 equal terms totalling the grand sum of 30. 

Considering the formulas in Fig. 4 by inspection the within group 
(intra-group) variation is numerically equal to the area AA BCBC, an L- 
shaped area with a value of 35. The between group (inter-group) vari- 
ation is numerically equal to the difference between the two equal 
areas represented by the square BCBC. Thus the between group vari- 
ation is 0, and no variation exists between the three series. This is 
indeed confirmed by our common sense evaluation since there were 
four determinations in each of the three series with a common sum of 
10 and a common mean of 2.5 in each series. The within group variation 
of 35 is equal to the sum of the variations of the three separate groups 
of data as previously displayed in Figs. 1-3: 7.5+0+27.5=35. 

The lack of between group variation in the above three series 
characterizes them as a special case, and a more generalized set of 
three series consisting of four terms each but with differing sums and 
means is shown in Fig. 5. The square area between the line AA and the 
corner is equal to the sum of the squares of the 12 individual terms in 
the three series and is equal to the sum of the stippled areas. The area 
by which this square exceeds that defined by BB, namely 23.5, is a 
measure of the variation within groups. The square defined by BB is 
equal to the sum of the three cross-hatched areas in each series, and the 
area by which this square exceeds that defined by CC, namely 12.5, is 
a measure of the variation between groups. The smallest square defined 
by CC is obtained by adding the squares of 12 equal terms with a sum 
equal to 30. The formulas and values for substitution are indicated in 
the figure. 

In Fig. 5 the left lower corner contains two additional squares that 
represent the effects of the correction for the degrees of freedom upon 
which the variation within and between groups is based. The larger 
square defined by HE derived from the L-shaped area BBCC of 12.5 
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COMBINED GROUP DATA II 


Rie 


87.5—75=12.5 
111—87.5=23.5 





m NK iy = —s8 
Within groups >> [ > (V — Vey | = DV? -d (NxVr) 
1 1 1 


= 111 — 4(2.5* + 1.25? + 3.75%) 
= 111 — 87.5 = 23.5 
m sas ~" = —3 —3 
Between groups: ), [ Nx(Vx - 7) = >) (NkVxk) — NV 
1 1 
= 4(2.5% + 1.25? + 3.75%) — 12 X 2.5? 
= 87.5 — 75 = 12.5 
P test: Square EE - 12.5 + 2 - 6.25 
SquareDD 23.5+9 2.61 


Fia. 5 


= 2.39; P>0.05 





is corrected for the 2 degrees of freedom possible in the comparison of 
the three series and has a value of 6.25. The smaller square defined 
by DD derived from the L-shaped area AABB of 23.5 and corrected 
for the 9 degrees of freedom, on which the within group comparison 
was based, has a value of 2.61. These areas defined by EE and DD 
represent the variance of the between group and within group compari- 
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sons respectively. The F test (2, 5) is the comparison of these two val- 
ues, and the variance ratio with the corresponding P value is shown. 


III. DISCUSSION 


Since instruction in statistics is assuming a more prominent position 
in the curriculum of universities, pedagogical techniques must be 
developed to aid the student in grasping the concepts of statistical 
theory. Visual representations are helpful, and there is merit in stregs- 
ing an interpretation of the analysis of variance which emphasizes 
that the sum of the squares of the parts of a whole will approach 
minimum as the parts achieve increasing uniformity. From this basic 
proposition, a graphical representation of certain statistical concepts 
inherent in the analysis of variance may be developed. These tech- 
niques in no way replace the mathematical derivation of the formulas, 
but they may serve as a supplement. 
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NOMOGRAPH FOR COMPUTING MULTIPLE 
CORRELATION COEFFICIENTS 


Freperic M. Lorp 
Educational Testing Service 


HE accompanying nomograph! may be used for calculating a multi- 
Tre correlation coefficient (R;.23) from the zero-order correlations 
(r;2, Tis, aNd 723). 

USE OF THE CHART 


If either riz Or 713 is negative, some sign changes are necessary before 
the nomograph can be entered. If only one of these two coefficients is 
negative, it must be made positive and the sign of res reversed. (This 
change is possible without affecting R.23 because it is equivalent to a 
simple reflection of the corresponding predictor variable.) If both riz 
and r;3 are negative, both must be treated as positive, the sign of rg 
remaining unchanged. 

In theory, it makes no difference which predictor variable is called 
variable 2 and which is called variable 3. In entering the nomograph, 
however, the user should always take as variable 2 the variable that 
has the numerically higher correlation with variable 1, so that 
| ras| =| rial. 

The network in the left half of the nomograph consists of a curvi- 
linear coordinate system, similar to the ordinary rectangular Cartesian 
coordinate system. One variable, riz, is represented by approximately 
horizontal curves labeled at their left extremity; the other variable, ris, 
is represented by approximately vertical or by oblique curves, labeled 
at the top and intersecting the curves for riz. The “curve” representing 
r13=0, it may be noted, is the vertical straight line at the extreme left. 

In general, all scales on the nomograph are marked off in intervals of 
.02. In certain areas where the curves would otherwise be too close to- 
gether, however, alternate curves have been omitted, so that the curves 
appear at intervals of .04. Linear interpolation between the curves is 
all that is required. (Since |r| is chosen greater than |ri3|, the user 
will not ordinarily wish to use the section of the graph where | r2| <.12.) 

The first step in using the nomograph is to find the point in the co- 
ordinate system at the left represented by (ri, ris). This is done by 
finding the intersection of the horizontal curve for ry with the vertical 
or oblique curve for 73. If r1:=713, the required point is to be found 
along the oblique straight line so labeled. 


1 Enlarged copies of the nomograph covered with laminated plastic may be obtained without charge 
by writing to Research Division, Educational Testing Service, 20 Nassau Street, Princeton, New Jersey. 
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The next step is to find the point representing the value of 7; on 
the vertical scale at the extreme right. A straight-edge or ruler is then 
placed on the nomograph so as to pass through both this point and the 
previous one. It will be found important to use the lower edge of the 
ruler, rather than the upper, in order to avoid covering up portions of 
the nomograph necessary for the following step. 

The network in the right half of the nomograph, like that in the 
left, represents a system of curvilinear coordinates. One variable, r.;, 
is represented by the approximately horizontal curves (labeled at the 
left and the right) ; the other variable, R1.23, is represented by the verti- 
cal straight lines (labeled at top and bottom). The final step in the 
procedure is to start with the point just found on the vertical scale 
at the extreme right and follow the r23-curve, passing leftwards along 
it until this curve again intersects the straight-edge. The R,.23-value 
of this intersection is the required value of the multiple-correlation 
coefficient. 

The user will wish to cover the nomograph with some kind of water- 
proof plastic material on which the first point (riz, 713) can be marked 
with a dot of ink. Some method of marking this point temporarily is 
a virtual necessity. The ink can be readily removed from the plastic 
with a damp cloth (or finger). 

The reader may wish to try out the procedure just described with 
the following examples: 

Ti2 re '23 Ri.23 
.52 .22 .40 .520 
.32 .58 .04 .652 
.48 — .22 .18 .572 
-20 .02 -60 . 236 


It will be observed in working out the last example that the value of 
R;.23 cannot be determined with great accuracy from the chart because 
the straight-edge and the curve for re; are nearly parallel at their point 
of intersection. This type of difficulty will be apparent to the user 
whenever it occurs, which is only for low values of R.23; it is for this 
reason that no lines are given for R,.2;< .20. The foregoing illustrates the 
fact that no general statement can be made regarding the degree of 
accuracy in the value of R,.2; obtained from the chart. 


ON THE CONSTRUCTION OF NOMOGRAPHS 


The construction of nomographs can be a challenging although time- 
consuming pastime, especially if methods involving the formulation of 
the nomograph as a determinant are used. This method is treated in 
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only @ minority of the texts on the subject;? it will be outlined very 
briefly here, using the present nomograph as an illustrative example, 
in order to indicate the general procedure. 

First, the standard equation for Rj.23 is written: 


rio? + ris? — 2rierisre3 
Ri.23 = / . (1) 


1 — 1237 








All terms must be collected on the left side; it is convenient to square 
and clear of fractions before transposing, however: 


Ri.23(1 — 123”) — Tie? — nis® + 2rieristes3 = 0. (2) 


Equation (2) must be written as a determinantal equation with 0 on the 
right side and a third-order determinant on the left. This is easily done 
in various Ways, e.g., 


1 — 2rieris 0 
Te3 +R(1 = 123”) 1 = 0, (3) 
0 nia? + 13" 1 


as may be verified by expanding the determinant. 
Asa crucial step, equation (3) must be rewritten in the following form: 


fu fx 1 
fa fe 1) =0, (4) 
fu fe 1 


with only two of the variables (in this case, correlation coefficients) 
appearing in the functions f;; found in any single row. This rearrange- 
ment is achieved (if at all) by the usual procedures in which multiples 
of one column or row of the determinant are added to another column 
or row. 

Finally, selected values of the variables (correlation coefficients) 
are chosen and the corresponding points (fu, fiz) are plotted in ordinary 
Cartesian coordinates. The points determined by a single value of one 
variable define a curve; such a curve is drawn for each convenient 
value of each variable. The resulting network of curves constitutes a 
curvilinear coordinate system, such as appear on both the left and the 
right sides of the accompanying nomograph. Three such networks 
must be drawn corresponding to (fu, fi), (fa, fez), and (fu, fs). These 


2A recent text is Allcock, H. J., and Jones, J. R., The Nomogram. (4th Ed. Revised by J. G. L- 
Michel.) London: Pitman, 1950. 
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constitute the nomograph. Ingenuity usually permits one or more of 
these networks to be collapsed into a single curve or straight line. In 
some extremely simple cases, the familiar but very specialized form of 
nomograph composed of three straight lines may result. In the present 

se, one network has degenerated into a straight-line scale, which has 
been made to coincide with an identical scale in one of the other net- 
works. 

If equation (1) can be expressed in the form given by equation (4) with 
the stated restrictions on the f’s, this fact is a necessary and sufficient 
condition for the existence of a nomograph of the general type dis- 
cussed here. If the result can be achieved at all, it can be achieved in 
a variety of ways, some of which are much more convenient for the 
user of the nomograph than others. No set routine can be laid down for 
getting blindly from equation (3) to equation (4) without violating the 
restrictions on the f’s, much less for obtaining an optimum result from 
the point of view of convenience to the user. 

The determinantal equation corresponding to the accompanying 
nomograph is 

0 1.873 1 
—R, -23 6 [r23(3 a 2Ri .23) + (1 _ 123") Ry 23° | 1 
1.5 9(ris? + 713”) 


| 1 
retis + 1 Tatis + 1 











APPLICATIONS OF CORRELATION MODELS 
FOR BISERIAL DATA 


R. F. Tate* 
University of Washington 
This article is addressed mainly to applied workers, and js 
devoted to giving as complete a picture as possible of the 
techniques and assumptions involved when correlations are 
estimated from biserial data. The exposition is nontechnical 
in the sense that mathematical proofs are omitted and numer. 
ous examples are given. The related theoretical material jg 
contained mainly in two recent articles by the author [11, 12]. 


I. INTRODUCTION 


HE usual model for correlation has been dealt with extensively by 

many authors, so a discussion of it per se will not be given. It does 
serve, however, as a good starting point for the present discussion, and 
will therefore be presented as Model A in the next section. The biserial 
and point-biserial models will then be introduced as alterations of 
Model A. In sections 3 and 4 the latter models will be treated individu- 
ally, together with illustrative examples. Section 5 will deal with com- 
parisons between the various models. Finally, Section 6 will close the 


discussion with observations on the validation of the assumptions in 
the biserial and point-biserial models and comments concerning which 
coefficient to use in a given experimental situation. 


II. CORRELATION MODELS 


Consider two continuous chance quantities X and Y, and let the 
parameter p be their population correlation. In order to speak of p no 
special assumptions need to be made about the joint probability dis- 
tribution of X and Y, except that the correlation exists. However, it is 
a fact that for the known estimation procedures and tests of signifi- 
cance associated with p we must require that X and Y have a bivariate 
normal distribution. In fact, normality assumptions will be made 
throughout this paper, and they are all motivated by the same need, 
namely that of a well-defined basis for making probability statements, 
which in addition is easy to work with. 


MODEL A 


(1) Chance quantities X and Y have the bivariate normal distribu- 
tion given by the bell-shaped surface 





* Research sponsored by the Navy Theoretical Statistics Project, Office of Naval Research. 
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1 
2QrotvV/1 — p* 


1 (x — pw)? p(x — -—y 'y — v)? 
exp - { E uw)? 2p — wy ),Y }} 


2(1 — p?) o oT 7? 


where p, v and o, 7 are respective means and standard deviations of 
X and Y, and p is the correlation between X and Y. 

(2) A random sample’ (X;, Y;), 7=1, 2, - - -, n, is chosen from the 
(X, Y) population. Some function of these observations is to be used to 


estimate p. 
The estimate which is customarily used is r=sxy/sxsy, where 





¥(2,y) = 








1 tee 1 
sx? = — bo (X; -_ X)?, sy? = — ; (Y; = Y)?, 
n 


n 


1 ai ae 
8xy = Ps > (X; — X)(Y; — Y). 


The quantity r is called the (Pearson) product moment correlation 
coefficient. 

In certain situations it is impossible to observe the Y values, and yet 
an estimate of p is still desired. If the Y distribution is observable in 
dichotomized form, such estimation is possible. A precise description of 
the situation would be the following. 


MODEL B 


(1) Chance quantities X and Y have the bivariate normal distribu- 
tion of Model A. 
(2) A chance quantity Z is defined by 


z={ (Y —»)/r2o 
a (Y — »)/r <a, 


where w is a real number known as the point of dichotomy of the Y 
distribution. 

(3) A dichotomized random sample (X;, Z;), i=1, 2, - - + , n, is ob- 
tained from the (X, Y) population. Some function of the X and Z ob- 
servations is to be used to estimate p. 

An estimate which is sometimes used is r,=8xz/sxX(ws), where 
My) =1/.\/2e exp (—y?/2), the normal density, and w is determined 
by looking up in the normal table the value of w for which 





1 Independence of observed couples is implicitly assumed in the phrase “random sample”. 
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io) 
f A(y)dy = Z. 
wW 


ry is called the biserial correlation coefficient. 

Notice that in the definition of Z the values of Y enter in the form of 
standard deviations from the mean. No generality will be lost, then, by 
choosing y= 0 and r=1. We accordingly make this assumption through- 
out the paper. The distribution of r, contains four unknown parameters, 
u, ¢, w, p. In Section 4 properties of r, and the maximum likelihood 
estimate j for Model B will be described. 

Now let us go one step farther and drop the assumptions on FY alto- 
gether, dealing entirely with X and Z. 


MODEL PB 


(1) The chance quantity Z takes values 0 and 1, and has the dis- 
tribution given by 


gz)=p(l—p)'* 2=0,1; O<p<i. 
(2) The chance quantity X has the conditional distribution* 


f(z|z) = = em (2-H) /267 


o us 


When Z=z. 

(3) A random sample (X;, Z,;), t=1, 2, -- -, n, is chosen from the 
(X, Z) population. Some function of these observations is to be used 
to estimate p,, the correlation between X and Z. 

Again, the customary estimate is r,, = 8xz/8x8z. 7» is called the point- 
biserial correlation coefficient. ) 

Considered as functions of observations, r and r,, have the same 
functional form. It should be remembered, however, that these two 
coefficients have entirely different probability distributions, since the 
models underlying them are quite different. 

It is more convenient to discuss Model PB first, since some of the 
properties will be useful later when we consider Model B. 


III. THE POINT-BISERIAL MODEL 


It was pointed out above that r and r, have the same functional 
form. It follows, therefore, that certain well-known properties of r are 
shared by r,,: namely, 





? Note that X itself isin general not normal, but is instead a mixture of N(u:, o%) and N(ue, o*) in 
the proportions p and g. 
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(i) ry» takes some value in the closed interval [—1, +1] for each 
possible random sample (Xi, Zs); 

(ii) rp» remains unchanged if the observations are coded; that is, we 
can write X;/=aX;+b, and r» for the random samples (X;, Z;) and 
(Xi, Z;) will be the same; 

(iii) r» = +1 if and only if the two chance quantities are linearly re- 
lated. Since Z is always 0 or 1, this means that the X observations 
would have to be concentrated at two points. The probability of such an 
occurrence is 0, but of course sample r,, values of +1 can arise; in fact, 
r» is a continuous chance quantity (even though Z is discrete), 80 any 
particular value has 0 probability and all values in [—1, +1] are possi- 
ble as was asserted in (i). It is not possible, of course, for p; to take 
values +1. This is prohibited by the model. 

With the aid of some additional notation we can express 7, in a 
form which is probably more familiar to some readers. 


Ni= >.Z;, the number of Z observations with value 1. 
No=n— >,Z,, the number of Z observations with value 0. 
p = N 1/ n 
q= N o/ n 
X,=the average of the X values for the Z’s which are 1. 
X)=the average of the X values for the Z’s which are 0. 
X =the average of all X values. 
ry can then be expressed as 


Tp = 


It is an algebraic fact that 


No N1 
nsx? = >> (Xox — Xo)? + D (Xi; — Mi)? + nfG(Xi — Xo)? 


i=l j=l 


It should be emphasized that each of the new symbols represents a 
chance quantity. If one thinks of the Z values as being fixed numbers, 
with n; ones and mo zeros, 7,» still has a meaning, but p; is no longer a 
correlation coefficient, since the possibility of correlation has been re- 
moved from the problem. 

If we let the standardized difference of means (4:—j0)/o be denoted 
by A, then p; can be written explicitly in terms of the parameters A and 


p (see [11]) as 
a a4/ p(l — p) 
1 + p(l — p)A? 
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In connection with this expression for p; we should recall that Model 
PB specifies that the two conditional distributions of X, for Z=0 and 
Z=1 respectively, are normal. A is thus unrestricted and may take 
any value in (— ©, +). It is then apparent that, regardless of the 
value of p (0<p<1), p may take any value in the (open) intervya| 
(—1, +1). The (unconditional) distribution of X may itself be normal, 
A condition which is both necessary and sufficient for X to be N(u, 7%) 
is that 4.=yo=yu, o?=7?. This will come up again in Section 6. 

The joint probability distribution of X;, X2, ---, Xn, Z1,Z2, +--+ ,Z, 
may be expressed as 


; bh. 1 Xe ; 
f(x, +6, Bas My, +, Zn) - (=) = -— toate (om — mh 


1 N1 
exp — {5 Dd (ay — mw) p(1 — p)”, 
20? j=l 
where n= >.2;, Nx=n—nz, and the zo; and 2; represent the subdivision 
of z values according to the corresponding z values. 
For maximum likelihood estimates, we obtain, after the usual differ- 
entiations, 





fo = Xo, ii = X, pb=Ni/n, ¢ = Vsx? — bg Xi — Xo). 


Substituting these estimates in the formula for p; gives r,, and thus 
verifies that r,, is a maximum likelihood estimate of py. 

The exact distribution for r,, is useful in some applications when 
small samples are considered. In general, though, for large samples the 
additional accuracy obtained by exact methods is not worth the extra 
labor. In order to use that exact distribution in the important applica- 
tions it is still necessary to indulge in a small approximation. This will 
be made clear in connection with an example. In what follows we will 
describe the “exact” method, illustrate and then consider large sample 
approximation procedures. 

Let us consider the statistic® 


T = rpV(n — 2)/(1 — rpp?). 


If we knew the distribution of 7, we could if necessary find the dis- 
tribution of r,, from it. It is actually unnecessary for applications to 
know the distribution of r,,; that of 7 will suffice. Moreover, the dis- 
tribution of T is not difficult to arrive at. It turns out to be a combina- 
tion (mixture) of non-central ¢ distributions. 








* This transformation was introduced by Lev [6]. 





CORRELATION FOR BISERIAL DATA 1083 


A non-central ¢ distribution‘ contains two parameters, the number 
of degrees of freedom d and the parameter of non-centrality 6. In our 
situation there will be a parameter of non-centrality 6(n:) for each 

ible value of m; (see [11]): 
Let ¢(t, 6(m1)) be a non-central ¢ distribution with d=n—2 and 


m(n — m) 
5(m) = af . 


Then 7 has the distribution 


n 


é(t) = > (") enc — p)r-™¢(t, 6(m)). 


n,=0 


Note that ¢(/) contains the unknown parameters p and A, and that if it 
is desired A may be reexpressed in terms of p and py. 

An important type of problem which may be handled with “exact- 
ness” by the distribution of 7’ is that of determining whether r,, is 
significantly different from 0. In other words, we can test the hy- 
pothesis H:p,=0. When hypothesis H is true, then p,=A=0 and é(¢) 
becomes the ordinary ¢ distribution with n—2 degrees of freedom (see 
{11] and Lev [6]). If the level of significance a is fixed, then the two- 
sample ¢ test is: reject H when | T| 2k, where ka is obtained from the 
ttable. The power function of this test, using level of significance a, is 


n n +ke 
B(p, pr) = 1 — >(")pna = £(t, (m1) )dt. 


n,=0 ke 


Now, if p is specified and p, is specified, we can use the table of Neyman 
and Tokarska [8] (provided a=.02 or .10) to calculate B(p, pi), the 
probability that the test would reject the null hypothesis under these 
conditions. We must express 6(m;) in the form of 


avnm(n — m)/np(l — p)(1 — pi’). 


Example 1: Let X be the total income of a household; let Z be 1 if 
at least one member of the household attends the theater during a given 
week and 0 otherwise. Suppose that 10 households are surveyed with 
the result that r,,= +.6624. At the .10 level of significance do we con- 
clude there is a connection between total income and theater enter- 
tainment? What is the probability of making the right decision if in 
reality p=} and pp=2/./2=+.707? 


‘ Johnson and Welch [5] have discussed this distribution at length with tabies and examples. 
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d=n—2=8, a = .10, ka = 1.860 
T = ryoV/(n — 2)/(1 — rps?) = 2.501. 





The answer to the first question is therefore yes. To answer the second 
question we construct Table I, filling in column (4) from Table I of 
Neyman and Tokarska [8]. We are considering a two-sample ¢-test with 
level of significance a=.10. Their table is constructed for a one-sided 
t-test with significance level a=.05. It is current practice to use their 
table, although small errors will creep in, due to the fact that the non- 
central ¢ distribution is skewed. The error of any entry in column (4) 
will be smaller for smaller entries (see p. 326 [8]). It can be seen from 
our Table I that large entries in column (4) will in general be multiplied 
by small entries in column (2) in the sort of problem we are discussing, 
Therefore, if p is not close to 0 or 1, we should expect good accuracy, 
Also, the first entry, .900, is exact (the Neyman-Tokarska table would 
have yielded a slightly higher value), so no error has arisen here. The 
probability of making a type II error when p=}, p:=1/+/2 is the sum 
of the last column. Therefore, the answer to the second question is 


1 1 
(= —) = 1 — P(Type[] error) = .832. 


3 V2 


For large samples the following approximation holds (see [11]): rp 
is an approximately normally distributed chance quantity with mean p, 


TABLE 1 








Col (2) X 
Col 4 


3 


10 oe the 
( aoa 5(m) = V/.45m1(10 —m) fe o(t, 6(m))dt 


m1 





-0173 -900 -016 
-0867 425 037 
-1951 : -214 .042 
.2601 ‘ -129 .034 
-2276 ‘ -090 -020 
. 1366 ° -083 O11 


aor WN © 


-0569 ‘ -090 -005 
-0163 ‘ -129 -002 
-0030 ‘ .214 -001 
-0003 ° -425 -000 
-000 -950 -000 


— ii? a i: 


— 
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and standard deviation 





pi? + 2p(1 — p)(2 — 3p’) 

4np(1 — p) 
When values for p and p; are unknown and we want g,,,, we have no 
alternative but to use the approximate values # and rp». 

Example 2: Let X be the IQ of an individual. Let Z be 1 if the indi- 
vidual votes at a given presidential election and 0 if he does not. As- 
sume that the people under consideration are citizens with a permanent 
residence in some state. Consider a sample of 100 individuals in which 
65 voted and 35 did not vote, and for which r,,=+.40. Test the hy- 
pothesis that p:=} at significance level .01, and find 99% confidence 
limits for p1. 

n = 100, nm, = 65, no = 35, a= Oi 


_ 84 (16 + 455(1.52) 
*'» = 99 2275 


From the normal distribution we obtain the figure 2.58, so 99% confi- 
dence limits for p, are .40+2.580,, or (+.19, +.61). To test the hy- 
pothesis p; =} at the .01 level we observe that .25 lies in the confidence 
interval, so we accept. 

If the value of p is known, a definite improvement can be made on 
the above procedure, even for samples only moderately large. This ob- 
servation stems from the fact that if p is known, the standard deviation 
contains only the parameter »:. By a technique first used by R. A. 
Fisher ({3], pp. 197-201), and known as a “variance stabilizing transfor- 
mation,” we can transform r,, and obtain a new statistic whose large- 
sample standard deviation depends only on the sample size. Such trans- 
formations will exist for most fixed values of p. We give the transforma- 
tion for the case p=} for the following reasons. First, situations in 
which the characteristic measured by Z is sex are among those which 
most clearly can be treated with Model PB, and p would in such cases 
represent the proportion of males or females in the population. Second- 
ly, the large-sample standard deviation ¢o,,,’ takes on its smallest possi- 
ble value (when p; is fixed) at p=}. For this case the transformed sta- 
tistic is given by® 





or, = (1 _ pi”) 








= .0813. 





1+ Vi— (= rp) 
Pare es ar 








1 
¢(rpo) = sgn (rp») ry log. 





5 sgn (rpp) is —1 for rps <0, 0 for rp» =0, and +1 for rp, >0. 
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The result we use, then, is that the chance quantity $(rp) ts approzi- 
mately normally distributed with mean $(p:) and standard deviation \/2]n 
when p=} and n is moderately large, say 25. 

Example 3: Suppose two independent samples, containing 25 and 
30 pairs of observations respectively, result in values rp.“ = +.20 and 
rp»? = —.10. Assume p™ = p® =4 for the two populations. Is there 
ample evidence at significance level a=.05 to accept the hypothesis 
that the two correlation coefficients p; and p; are equal? Using the 
symbols F, and F; for the two required values of Fisher’s z, we have 
that 


sgn (rps) Fi = sgn (rps) Fe 


is approximately normal with mean 0 and standard deviation 
V 2/25+2/30 =.383 when the hypothesis is true. 
From Table VIII of Pearson [9], we obtain 


V1 — (1 — .20?)? = /.0784, V1 — (1 — .10?)? = 7.0199. 
The square roots are respectively .280 and .141. Using Table VB of 
Fisher [3], we have 

sgn (+.20)(.288) — sgn (—.10)(.142) = + .430. 


From the normal table we obtain 1.96 for a=.05. .430 <1.96 (.383), so 
we accept the hypothesis. 

Example 4: For an example illustrating the use of the transformed 
statistic ¢(r,») for calculating confidence limits for p; see [11]. The data 
there are for the situation X =IQ and Z=1 for male, 0 for female. 








IV. THE BISERIAL MODEL 


Since Pearson used the “method of consistency” to develop 7», it is 
certainly a consistent estimate of p; that is, the probability that r, will 
deviate from p by any preassigned amount becomes small as the sample 
size increases. 

In connection with Model PB it was noted that r for Model A and 
rp» for Model PB had the same functional form. They were used with 
different assumptions on the sample, and so possessed different prob- 
ability distributions. In connection with Model B we will need to speak 
of the sample product moment correlation coefficient for the sample 
(X;, Z;). In order to be consistent with the above convention we intro- 
duce for this statistic a new symbol, r’. Corresponding to r’ we have 
the population parameter p’ = Corr (X, Z). r’ and p’ should not be con- 
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fused with r», and m. Now, 7 has an alternate expression, similar to 
that for r» and using r’: namely, 
Xi — Xo p4 8z 


ne ae 


= 7 . 
8x d(ws) A(wo) 


It has been known for some time that r, may exceed 1 in magnitude 
for some samples, although this alone does not prevent the statistic 
from being used, because such an event rarely occurs. There follows a 
short discussion of the magnitude of rp. 

First, there is the question of what happens in the limit® as n>. 
Population parameters p and p’ are related as 


, VPC — p)- 
A(w) 


It can be shown (see [12]) that both / p(1—p)/d(w) and its sample 
counterpart sz/A(w) are never less than +/x/2. Thus, since —1<p 
<+l, 





% = 


—V2/4 <p! < + V2/n. 


However, in the case of r, and r’ we see that for any fixed sample size 
-—1<r’'<+1 (the reasoning is the same as for the case of r). Thus, 


|rs| <1 when |7’| < V2/n 
rm <—1 when Pr < — V2/n 
ty> +1 when r’ > + V2/z. 


There is, then, an essential difference between the limiting case and 
the fixed sample case. The fact worth noting is that not only do we 
have, for certain sample sizes, samples for which r,>1, but the prob- 
abilities will be positive: In order to clear up this point the following 
illustration is offered. 

Example 5: Select arbitrarily a number greater than 1, say 4. Let 
n=1000 and p, w, p be subject only to their defining assumptions, 
0<p<l, —x~<w<+o, and —1<p<+1. The probability that the 
chance quantity # takes the value .999 is 1000 p°*°q. This value yields 
8z/(w») = 9.6. At the same time it can be seen, subject to the condition 
p=.999, that we have P(r’>.417| $=.999) positive. Multiplication of 
17 and 9.6 shows that P(r,>4) is positive. 


* What is referred to here is the stochastic limit, or limit in probability. The point is that rp and r’ 
are consistent estimates for p and p’ respectively. 
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It is obvious that the result of Example 5 holds for any (fixed) n 
greater than 1000, although the positive probability, P(r.>4), de 
creases with increasing n. This is consistent with the fact that in the 
limit p must be <1 according to the model. The reason a large n is used 
for the example is that we need a positive probability for to be near | 
(and yet not equal to 1)). 

Soper [10] derived the large-sample standard deviation for r. His 
expression is 


1 prelate, Ope)—Ne 37), plo)ate) 
m= wre | a) Nw) =|+ (a) 








where as before \(w) =1/+/2x exp (—w*/2), p(w) =P(Y 2w) =P(Z=1), 
and q(w)=1—p(w). For a table of o,,./n see Table I of [12]. 

It may be shown (see [12], Theorem X) that o,, takes on its smallest 
value (for p fixed) at p(w) =4, that is when w=0. This may also be con- 
firmed by a glance at the table just mentioned above. 

As the reader may have supposed, it may be shown (see [12], Theo- 
rem VI) that for large samples r, is an approximately normally dis- 
tributed chance quantity with mean p and standard deviation given by a, 

Again we have a situation with too many parameters, namely w and 
p. If we make the substitutions p=r, and w= in ¢,, approximate an- 
swers may be obtained as in the case of r,,, Example 2. 

Example 6: Let X be the score of an individual on a comprehensive 
test in mathematics which is given at the end of 4 years of college. Let 
Z be 1 if a certain question on a freshman mathematical aptitude test is 
answered correctly and 0 if it is not. Suppose that a sample of 200 stu- 
dents gives the results n; = 84, no = 116, m = —.20. Using level of signifi- 
cance a=.01, do we have reason to believe that this particular test 
item should be omitted from the battery of test questions? 

The situation described calls for a test of H:p=0. Using the data 
n=200, =.42, r»= —.20 and Table I of [12], we obtain 


1.224 0866 
or, — 4/200 — < . 


| _ .20| <2.58 o,,=.222, so we accept hypothesis H and assert that the 
item should not be used. 

Due to the minimal property of o,, at w=0 our attention is again 
focused on this case. For Model B we obtain the variance stabilizing 
transformation 
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Wn) = = Hoe, 1 + (2r/V/5) 





1 — (2r/+/5) 


and it can be said that (rs) ts approximately normally distributed with 
mean y(p) and standard deviation »/5/4n. 
Example 8: Using the data and conditions of Example 7 we will use 
the transformation ¥(r,) together with the assumption that p=}. 
From Table VB of Fisher [3] we obtain 


1 1 — .1789 1 1 + .1789 
— log + = so. sae oe ee — .181. 


gq "1—-(-179) 2  1-—.1788 





| —.181| <2.58\/5/4n=.204, and this time we have the same conclu- 
sion, accept H:p=0. 

The case of 7, yielding an impossible value of p could be taken care 
of, after a fashion, by truncating r, at +1. A much more serious defect 
is that 7, lacks satisfactory large-sample efficiency. That is, its large- 
sample standard deviation is, for certain values of p, much larger than 
o; the large-sample standard deviation of the maximum likelihood 
estimate ‘p. In order to express results more precisely let us denote the 
efficiency of r, by 

07? 
e(w, p) = : . 
or, 
The following facts concerning e(w, p) may be demonstrated (see [12], 
Theorems VII, VIII): 


(1) e(w, 0) = 1 
(2) e(w, —1) = e(w, +1) = 0. 


The facts are, then, that from the point of view of large-sample effi- 
ciency, 7 is about as good as @ if p happens to be near 0, but if p is near 
+1, p is definitely preferable as an estimate of p.’ Since we generally 
don’t have advance information on p, we should use p whenever time 
and money permit us to compute it. 

The above considerations bring us to the question of how to compute 
p. This is conceptually easy, but in practice requires a fair amount 
of calculation. No closed form exists for the maximum likelihood esti- 
mates @ and #, so we resort to an iterative procedure. The method to 
be used is Newton’s method in two variables. The likelihood equations 





7% is thus much superior to rp in item evaluation in those cases in which large values of \el appear. 
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in p, o, w, p can be reduced (see [12], Section 6) to 
f=% @=8, > (22-1)6:=0, Do 2/(22:- 14; =0, 


where 





(= — 1l)\(w -— a) 
$ fine ’ 
with @ defined by ¢(x)=X(z)/p(x). In this connection recall from 
Section 2 that p(x) = f,*d(y)dy. 


ITERATIVE METHOD 


(1) For initial guesses of & and @ use w and rz (unless |7,| >1, in 
which case see Remark 1 below) 
(2) Compute the following for each i=1, 2, ---, n. 


wy — Tor,’ 
Vian 
o: — b7;, 5:2,'$i, A; = oi(¢i — 5), 
A ix’, A a", 


2,', 6; = 22; — 1, vi o; = $(5:7;), 


Values of ¢; may be obtained from the table of Birnbaum and Hartley 
[2]. 

(3) Evaluate the three determinants 
> A: rw, >, Ai — > Aw’ 
z. A,’ ToWd 7 Ay; —_ >. A,z,'? 
D - >> 5; Tews >A - > Ai,’ 

. — > bit'b: reo > At’ — D> Asai? 
bs A; ~ > 5:9; 
Zz A,z;’ -_ yt 5:25: 


(4) The first stage will result in values 


dD; 





Ds 





Be? tae 
Om te we a 


p2 = 1 +f Dri iiles rp?)3/?, 
dD, 


Repeat the whole process, if necessary, using we, p2 in place of ws, 7. 
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Remarks: 
(1) Ifm<—1 orm>-+1, we can use —.90 or +.90 respectively as a 


first guess for p, together with w, as a first guess for 6. Other possibilities 
for p, Which lead to convenient calculations, are \/.91 and /.96. 

(2) When 7 is near +1, the calculations will of course be more diffi- 
cult. In this case 2 or 3 stages of the procedure may be necessary. 
When 7’ is moderate in size, 1 stage will often be sufficient. 

Example 9: An example using 20 observations taken from a bivariate 
normal population with p=1/,/2, and resulting in r,=+.410, is given 
in Section 6 of [12]. One stage is required to produce the answer 
9=+.489 which is correct in the third decimal. 

Example 10: As an example of a difficult case consider the following 
data (Again we may think of X as IQ with Z denoting pass or fail): 


X 90 96 100 107 110 
Z 0 0 1 1 1 


Initial calculations yield w= —.256, = 100.6, s.=7.26, r’ =.8553. Re- 
calling that r,2r’\/x/2, we see 21.072. Therefore, assuming 17» 
=+.90 as an initial guess, together with w, = —.256 we could start the 
procedure. It turns out in this case that the resulting values of stage 
1 are w= —.3814 and p2=+1.025. This indicates that a larger guess 
on p is called for. Accordingly, choose p=1+/.96 as a next guess and 
w= —.3814. The following are the calculations for stage 2: 

xi 3s vi oi 5:25" $i — 8:75 A; 
—1.460 —1 +5.246 .000001 +.000001 +5.246001 -000005 
—0.633 —1 +1.194 .221315 +.140092 +1.415315 -313230 
—0.083 +1 —1.500 .138790 —.011520 +1.638790 . 227448 
+0.881 +1 —6.223 0 0 +6 .223000 0 
+1.294 +1 —8.246 0 0 +8 .246000 0 





+ .128573 +540683 
-000010 
— .018878 


0 0 
0 0 





—.217160 +.127085 
D 6%; = — .082526 = — .021068 
pw. >, A; — D> Ava’ = + 015109 = — .001773 
pave >, Awa,’ — D> Aga’? = — .045033 — .051596 
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te aide 
@ =w+t+—vJ/l — p? = — .3646 
Di 


D; 
Ps = p + — (1 — p?)?/? = + 9994. 
Di 


The true value of p therefore lies between +/.96 = +.9798 and +.9994. 
Analytic expressions for 75? and o;? are available (see [12], Theorem 
I), but have not been tabulated. 
As for r, and r,,, we again find that ¢ is approximately normal for 
large samples and that o;? is smallest when w=0. 


V. COMPARISON BETWEEN MODELS 


Setting aside for the moment any questions we may have concerning 
the validity of the Models A, B, and PB in given experimental situa- 
tions, let us make all of the appropriate comparisons we can. 

(1) Since all coefficients are estimates of population correlations, 
they are measures of the degree of linear relationship, and not neces- 
sarily of general dependence. 

(2) Under the conditions of Models B and Model PB rn, 6 andr,, are 
all unchanged by coding of the X values. 

(3) r, p, and r,, are maximum likelihood estimates for p, p, and p; 
respectively in the case of Models A, B, and PB. In addition 1, is 
essentially as good as a maximum likelihood estimate when p=0. 

(4) For large samples o,,2 07 >0,. The first inequality holds because 
p is &@ maximum likelihood estimate. The second inequality holds be- 
cause Model B gives restricted information for the Y observations of 
Model A and r is maximum likelihood for Model A. 

(5) Nothing is gained by comparing ¢,, or o> with a, oy SINCE T, and p 
estimate one parameter and r,, estimates another. 

(6) 1, rp, and p are all restricted to the interval [—1, +1], whereas 
r, is unbounded. 

(7) on, a3, and o,,, all take their smaucst values (considering p and p; 
fixed) when p=}, that is when the Z observ. ‘ions take the value 1 on 
the average 50% of the time. 

(8) A variance stabilizing transformation is available for r, 7», rT», 
and Table VB of Fisher [3] can be used in all cases. 


VI. ASSUMPTIONS UNDERLYING MODELS B AND PB 


There is considerable controversy over the question of when to use 
“biserial r” and when to use “point-biserial r.” In terms of this paper 


the question becomes that of when to use Model B and when to use 
Model PB. 





CORRELATION FOR BISERIAL DATA 1093 


Assumptions concerning the X observations in Models B and PB 
may be examined by using known statistical tests. The matter of de- 
ciding whether or not the Z observations may be considered to stem 
from a dichotomy of a continuous distribution is, on the other hand, a 
fairly open question which depends for the most part on what is known 
about the data at hand, or what may be assumed by invoking theo- 
retical concepts of the particular field of application. Comments will be 
made below in connection with this problem. It is hoped, at least, that 
certain misconceptions may be removed from consideration. 

First consider Model PB. This is the most satisfying from the point 
of view of a mathematical statistician. The quantity r,, is a maximum 
likelihood estimate; it is easy to compute. Moreover, in Section 3 it 
was seen that clear cut estimation procedures exist, especially for the 
case p=4. In any situation which produces biserial data, we can use 
ordinary tests of normality to investigate the assumptions on the con- 
ditional distributions which arise in Model PB. From Section 2 the 
probability distribution of an X observation is 


1 1 
h(x) = — g—(2—m)*/20? 4 (y — — e—(2—49)*/20 
(x) P ta ( ay. - 
One method of testing goodness of fit to the distribution h(z) is to 
use normal probability paper.* This is done as follows. 


(1) Compute Zo, Z, s 

(2) Using probability paper plot 2 curves. For observations 2, 
Toa, * * * » Lon, Plot 1/no (Number of observations $2o;) against 
(xo;—4o)/s. For observations 211, 212, - + * , Lin, plot 1/m, (Num- 
ber of observations £2,) against (2;;—2%1)/s. 

(3) If both curves are approximately straight lines, conclude that 
Model PB is satisfactory. 


The above tells us only whether the normality conditions for Model 
PB are satisfied. It doesn’t indicate anything about whether or not p; is 
the quantity we should be interested in. 

In the case of Model B we compute , s, and apply the considerations 
of the previous paragraphs. There is, of course, the possibility that the 
conditions of both Model B and Model PB will be satisfied.* In such a 
case we would have p=p,=0. p; would be 0 since normality of X in 





* It has been shown in a recent article by Chernoff and Lehmann [1] that the chi-square statistic 
does not have the x? distribution for large samples, as was previously thought, if maximum likelihood 
estimates based on the full sample are used. In our problem the siate of affairs is that if we pick a sig- 
nificance level a =.05 and use chi-square we wrongly reject the distribution h(z) more than 5% of the 
time. 

* Notice that in this case p: of Model PB and p’ of Model B (introduced in our discussion of rz) 
coincide. 
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Model PB entails uo=;. Then in view of the discussion preceding 
Example 5, we would have p=piv/pq/A=0. 

In the case of Model B the following observation is pertinent. If we 
consider a random sample (X;, Z;) in which all X observations have the 
same normal distribution, there will always exist chance quantities Y, 
with properties: 

(i) The Y,’s are independent and each has a normal distribution 
with mean 0 and standard deviation 1; moreover, (X;, Y;) is bivariate 
normal for each 7=1, 2,---, m. 

(ii) There is always a point of dichotomy w such that the random 
sample (X;, Y;) may be dichotomized to produce the random sample 
(X;, Z;) with which we started. 

This is interesting, because it means that with Model B we are never 
operating in a vacuum. If the X observations are normal, estimation 
and testing procedures will be valid for some p. 

With regard to which model we should use one school of thought is 
the following. If one finds that the X observations are normal, and has 
reason to believe that the sample (X;, Z;) should be thought of as result- 
ing from a dichotomy of some continuous distribution, then he may use 
Model B and define the p of the previous paragraph to be a legitimate 
measure of the relationship between X and Y. This is essentially the 
view expressed by McNemar ([7], p. 173) in relation to item evaluation 
for psychological tests: 

“As a matter of fact, one can usually justify the use of the regular 
biserial r with obviously continuous variables by saying that the co- 
efficients obtained represent what we would expect the product mo- 
ment correlation to be if we had a measuring scale for the dichotomized 
trait which actually yielded a normal distribution. The bothersome 
question concerning the normality of the distribution of the trait—if 
and when measured in some hypothetical, unattainable, equal units— 
is thereby side-stepped. If the point nature of the dichotomized variable 
can really be demonstrated, then and only then can one justify using 
the point-biserial correlation coefficient.” 

The other school of thought, and the one to which this author sub- 
scribes, may be summarized as follows. One can actually observe 
(X;, Z;). It is possible that the observations may be thought of as the 
result of a dichotomy, but one cannot be sure. Relying on the ob- 
servable characteristics of the situation one could then always consider 
p; to be the measure of interest, unless definite reasons can be given for 
assuming a continuous, normal underlying distribution. In this con- 
nection we quote Guilford ([4], p. 330), again relating to item evalua- 
tion in psychological tests. 
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“Since the r,s; coefficient is not restricted to normal distribution in 
the dichotomous variable, it is much more generally applicable than is 
r,. Whenever there is doubt about computing 7, the point-biserial r will 
serve. For this reason it should probably be used more than it is.” 

In summary we offer the following recommendations. 

(1) If the normality conditions are satisfied for Model PB, then use 
Model PB if there is any doubt concerning the validity of Model B. 
Employ the approximation procedures of Section 3 whenever possible. 
If any choice is available, try to arrange for p to be approximately 3. 

(2) If Model B is deemed to be valid, then use 7, to test hypotheses 
of the form p=0, and use 4, whenever it is convenient, for other 


purposes. 

In connection with (2) it would be extremely useful to have og? and 
7;?in tabular form. We could then not only make confidence statements 
about p using p but could decide how many additional observations are 
required to bring the accuracy of r, up to that of p. 


REFERENCES 


[1] Chernoff, H., and Lehmann, E. L., “The use of maximum likelihood esti- 
mates in chi-square tests for goodness of fit,” Annals of Mathematical Sta- 
tistics, 25 (1954), 579-86. 

[2] Editorial, “The normal probability function: Tables of certain area-ordinate 
ratios and of their reciprocals,” Biometrika 42 (1955), 217-22. 

[3] Fisher, R. A., Statistical Methods for Research Workers. London: Oliver and 
Boyd, 1946. 

[4] Guilford, J. P., Fundamental Statistics in Psychology and Education. New 
York: McGraw-Hill Book Co., 1950. 

[5] Johnson, N. L., and Welch, B. L., “Applications of the non-central t-distri- 
bution,” Biometrika, 31 (1940), 362-89. 

[6] Lev, J., “The point biserial coefficient of correlation,” Annals of Mathe- 
matical Statistics, 20 (1949), 125-6. 

[7] McNemar, Q., Psychological Statistics. New York: John Wiley and Sons, 
1949. 

[8] Neyman, J., and Tokarska, B., “Errors of the second kind in testing 
‘Student’s’ hypothesis,” Journal of the American Statistical Association, 31 
(1936), 318-26. 

[9] Pearson, K., Tables for Statisticians and Biometricians. Part I, Cambridge: 
Cambridge University Press, 1945. 

[10] Soper, H. E., “On the probable error for the biserial expression for the corre- 
lation coefficient,” Biometrika, 10 (1913), 384-90. 

[11] Tate, R. F., “Correlation between a discrete and a continuous variable. 
Point-biserial correlation,” Annals of Mathematical Statistics, 25 (1954), 
603-7. 

[12] Tate, R. F., “The theory of correlation between two continuous variables 
when one is dichotomized,” Biometrika, 42 (1955), 205-16. 





A MULTIPLE COMPARISON PROCEDURE FOR COMPARING 
SEVERAL TREATMENTS WITH A CONTROL 


Cartes W. Dunnertr* 
American Cyanamid Company 


I, INTRODUCTION 


COMMON problem in applied research is the comparison of treat- 

ments with a control or standard. Such a situation may arise, for 
example, when an agronomist tests the effects on crop yield of the addi- 
tion of chemicals to the soil, or when a pharmacologist assays drug 
samples to determine their potencies. In designing an experiment to 
measure the effects of such treatments, it is often desirable to include 
in the experiment a control in the form of either a dummy treatment, 
to measure the magnitude of the experimental response in the absence 
of the treatments under investigation, or some recognized standard 
treatment. Sometimes past experience with the control will suffice, but 
often this cannot be relied upon due to altered environmental con- 
ditions. Thus the agronomist may leave a few of his experimental plots 
untreated for comparison with the treated plots, and the pharmacolo- 
gist may measure the response to a standard drug preparation of 
known potency concomitantly with the test samples in order to esti- 
mate the potencies of the latter. 

We will consider the case where the numerical results of an experi- 
ment performed to compare p treatments with a control can be sum- 
marized in the form of a set of numbers Xo, X1, - - -, Xp and s, where 
the X’s are means of »+1 sets of observations which are assumed to be 
independently and normally distributed, Xo referring to the control and 
X; to the i-th treatment (i=1, - - -, p), and s is an independent esti- 
mate of the common standard deviation of the p+1 sets of observa- 
tions. This paper presents a procedure for making confidence state- 
ments about the true (or expected) values of the p differences X;— Xo, 
the procedure having the property that the probability of all p state- 
ments being simultaneously correct is equal to a specified value, P. 
Tables have been computed which enable the procedure to be used by 
the experimenter for P=.95 or .99 and p=1(1)9. When the numbers 
of observations in each set are equal, the tables enable the experimenter 





* I should like to express my appreciation to Frank Wilcoxon for suggesting this problem and for 
his constant encouragement throughout the course of the investigation. I am grateful for this oppor- 
tunity to acknowledge also my indebtedness to Robert Bechhofer and to Milton Sobel. Were it not 
for my good fortune to be associated with them while working on an Air Force contract at Cornell 
University in 1952-3, this paper could not have been written. 

I am indebted to Robert E. Bechhofer also for making the tables in reference [10] available to me. 
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to set one-sided upper (or lower) confidence limits on the true values of 
the p differences X;— Xo such that the probability is P that all p true 
yalues Will actually be less than the upper limits set, or to set two-sided 
confidence limits on the true values of the p differences Xi— Xo such 
that P is a lower bound to the probability that all p true values will 
actually be between the limits set. If the numbers of observations in 
each set are unequal, the tables may still be used but the associated P 
will be only approximate. The tables may also be used to set joint 
confidence limits on the potencies of p drugs relative tou a common 
standard, the associated P being approximately equal to the prob- 
ability that all statements will be correct when one-sided limits are set 
and approximately a lower bound to this probability when two-sided 
limits are set. 

The problem of multiple comparisons with a control is a special case 
of the more general multiple comparisons problem considered by 
Tukey [17] and Scheffé [16]. Tukey’s procedure based on the Student- 
ized range and Scheffé’s procedure based on the F-distribution enable 
the experimenter to make any number of comparisons among a set of 
sample means with the assurance that the probability of all confidence 
statements being correct will be equal to or greater than a specified 
value. When the experimenter only wishes to make comparisons be- 
tween one of the means «nd each of the others, as in the case when one 
of the means represents a control, use of the Tukey or Scheffé pro- 
cedure would result in confidence limits which are wider than necessary. 
The procedure described in this paper results in narrower confidence 
limits for the p comparisons X;— Xo than either the Tukey or the 
Scheffé procedure. 

In an earlier paper, Roessler [15] considered the problem of multiple 
comparisons involving a control. However, he assumed that the p com- 
parisons X;— Xo were independent which is incorrect since they all 
have Xo in common. In the present paper it is shown that, to obtain 
simultaneous confidence limits on the X;— Xo, the multivariate ana- 
logue of Student’s ¢-distribution defined by Dunnett and Sobel [4] is 
encountered. This same distribution was involved in a multiple de- 
cision procedure for ranking population means described by Bechhofer 
et al., [2], to which Tables 1a and 1b of the present paper are applicable. 
A multiple decision procedure for comparing several experimental 
categories with a control was formulated by Paulson [11]. Tables la 
and 1b of the present paper are also applicable to Paulson’s procedure. 
The procedure described in the present paper may also be considered 
as a multiple decision procedure; it is compared with Paulson’s in 
Section VII below. 
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For the benefit of those who may be interested primarily in applica. 
tions, the procedure is illustrated by two examples in Section II. The 
main part of the theory is given in Section III with a description of the 
construction of the tables in Section IV. In Section V, the question of 
the optimum allocation of available experimental resources between 
the control and the p treatments is considered. In Section VI, the pro. 
cedure is applied to the problem of estimating the potencies of p drug 
samples relative to a common standard. 


II, EXAMPLES 


(a) The following example was adapted from one given by Villars 
[18]. The data represent measurements on the breaking strength of 
fabric treated by three different chemical processes compared with a 
standard method of manufacture. 








Breaking Strength (Ibs.) 





Standard Process 1 Process 2 Process 3 





55 55 55 50 
47 64 49 44 
48 64 52 41 
Means 50 61 52 45 
Variances 19 27 9 21 





Here, p=3 and N=3. The average variance is s?=19, which is an 
estimate of the common variance of the four sets with (p+1)(N —1) =8 
degrees of freedom. It could be calculated directly as follows: 


, _ 55? + 47? + 48? + 55? + - - - + 41? — 3(50? + 61? + 52? + 45%) 
= 8 





152 
— = 19. 
8 


The standard deviation is s=1/19=4.36 and the estimated standard 
error of a difference between two means is s\/2/N =4.36./2/3=3.56. 
The quantity which must be added to and/or subtracted from the 
observed differences between the means to give their confidence limits 
has been called by Tukey [17] an “allowance” and is given by A 
=ts\/2/N, where ¢ is obtained from Table 1 if one-sided limits are 
desired or from Table 2 if two-sided limits are wanted. For p=3 and 
d.f.=8, {=2.42 for one-sided limits and ¢=2.94 for two-sided limits for 
P=95%. Analogous values of ¢ can be determined from the tables if 
P=99% confidence is required. 
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For one-sided limits, the allowance is A =(2.42)(3.56)=9 and the 
experimenter can conclude that: 

(i) The breaking strength using process 1 exceeds the standard by 
at least 61—50—9=2 lbs. 

(ii) The breaking strength using process 2 exceeds the standard by 
at least 52—50—9= —7 lbs. 

(iii) The breaking strength using process 3 exceeds the standard by 

at least 45—50—9 = —14 lbs. 

The joint statement consisting of the above three conclusions has a 
confidence coefficient of 95%, i.e., in the long run, 95% of such joint 
statements will actually be correct. Upper limits for the three differ- 
ences could be obtained in an analogous manner. 

For two-sided limits, the allowance is A = (2.94) (3.56) =11 and the 
experimenter can conclude that: 

(i) The breaking strength using process 1 exceeds the standard by 
an amount between 61—50+11=0 and 22 lbs. 

(ii) The breaking strength using process 2 exceeds the standard by 

an amount between 52—50+11=-—9 and 13 lbs. 

(iii) The breaking strength using process 3 exceeds the standard by 

an amount between 45—50+11= —16 and 6 lbs. 

The joint confidence coefficient for these three statements is greater 
than 95%. (Due to an approximation made in computing Tables 2a 
and 2b, the tabulated values of ¢ are somewhat larger than necessary 
so that the actual P’s attained are slightly greater than 95 and 99%. 
No such approximation was made in computing Tables la and 1b.) 

(b) The following data are blood count measurements on three 
groups of animals, one of which served as a control while the other two 
were treated with two drugs. Due to accidental losses, the numbers of 
animals in the three groups are unequal. 








Blood Counts (millions of cells per cubic millimeter) 





Controls Drug A Drug B 





7.40 9. 12.80 
8.50 . 9.68 
7.20 ‘ 12.16 
8.24 9. 9.20 
9.84 10.55 
8.32 





49.50 . 54.39 
6 5 
8.25 ‘ 10.88 
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Computations: 
49.50? 35.60? 54.392 


7.40°+8.50?+ - - - +9.20?+ 10.55? — r ~~ ; 








(>) Ni) —(p+1) 
16.566 16.566 

~ 15-3 +12 
8 = 1.3805 = 1.175 


= 1.3805 





For d.f.=(>°N.)—(p+1) =12 and P=95%, t=2.11 (one-sided) or 
t=2.50 (two-sided). 
“Allowances” for differences from the control: 


One-sided: drug A: (2.11)(1.175)V/1/6 + 1/4 = 1.60 
drug B: (2.11)(1.175)+/1/6 + 1/5 = 1.50 
Two-sided: drug A: (2.50)(1.175)+/1/6 + 1/4 = 1.90 
drug B: (2.50) (1.175)./1/6 + 1/5 = 1.78 


If the experimenter is interested only in upper one-sided limits for the 

differences from the control, he can make the following statements: 
(i) Drug A raises the blood count by at most 8.90—8.25+1.60 
= 2.25 millions per cmm. 

(ii) Drug B raises the blood count by at iil 10.88 —8.25+-1.50 

=4,.13 millions per cmm. 

The joint confidence coefficient for these two statements is approxi- 
mately 95%. (Since the tables of ¢ were computed for equal numbers 
of observations per group, their use in the case of unequal numbers 
results in the desired probabilities being only approximately achieved.) 
Corresponding lower limits could be calculated in an analogous manner. 

If the experimenter desires simultaneous upper and lower limits on 
the differences, he should use the two-sided allowances as follows: 

(i) Drug A raises the blood count by an amount between 8.90 —8.25 

+1.90 = —0.25 and 2.55 millions per cmm. 

(ii) Drug B raises the blood count by an amount between 10.88 

—8.25+1.78=0.85 and 4.41 millions per cmm. 

An approximate lower bound to the joint confidence coefficient of 

these statements is 95%. 
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III. THEORETICAL BASIS 


Suppose there are available No observations on the control, N; ob- 
servations on the first treatment, - - -, N, observations on the p-th 
treatment. Denote these observations by X,; (1=0, 1,---, p; 7=1, 
9,---, N,), and the 7-th treatment mean, >, Xi/Ni, by X;. We 
make the assumptions usually made in the analysis of variance, namely, 
that the X,; are independent and normally distributed with common 
variance o? and means m;. We assume also that there is available an 
estimate s? of o?, independent of the X;, which is based on n degrees of 
freedom. For example, we may take 


p Ni 
v= DOD (Xs — ¥)*/n (1) 
i=0 j=l 
where n=(>-?.9 N;)—(p+1). Our problem is to obtain separate con- 
fidence limits for each of the differences m;— mp (t=1, 2, - - - , p), such 
that the joint confidence coefficient, i.e., the probability that all p 
confidence intervals will contain the corresponding m;— mp, is equal to 
a preassigned value, P (0<P<1). 

Consider first the case p=1, where there is only one treatment to be 
compared with the control. The method of obtaining confidence limits 
for m,—m., as described in almost any statistics textbook, is based on 
Student’s {-distribution. If we write 
X1 — Xo — (m: — mo) 

1 + 1 
Ni No 


which is normally distributed with mean 0 and variance o?, then 
t=z/s follows the Student ¢-distribution with n degrees of freedom. A 
lower limit on m,—m, with the desired confidence coefficient will be 


given by 





z= 


ite 1 1 
X1 — Xo — d's witty, 
if d’ is chosen so that 
Prob (¢ < d’) = P (2) 
Similarly, if an upper confidence limit is required, it will be given by 


Xi1—Xot+d’ e.. 
1 0 8 N, No 
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On the other hand, if the experimenter wishes to have bounds on 
m,— mz, in both directions, he may take 


where d”’ is chosen to satisfy 
Prob (|t| <d”) =P (3) 


The constant d’ or d’’ corresponding to the desired value of P can be 
obtained from tables of the percentage points of Student’s ¢-distribu- 
tion, which are widely available. 

Now consider the general case where there are p treatments and a 
control. Write 





and t;=z,;/s, i=1, 2,---, p. Then lower confidence limits with 
joint confidence coefficient P for the p treatment effects m;—my will 
be given by 


a a 
X:— Xe—d/s4/— + —, ¢=1,2,---,p), 
’ Ne. P) 


if the p constants d,’ are chosen so that 
Prob (t; < dy’, te < d,’, *)@ °,& < d,’) = P. 
Similarly, upper confidence limits will be given by 


On the other hand, two-sided confidence limits having the desired joint 
confidence coefficient will be given by 


ae: al 1 1. 
X.— Kot diteg/— +H, & = 1,2,---,p), 
if the p constants d;’’ are chosen to satisfy 


Prob (|t:| < di”, |t2| <ds”’,---,|t,| <d,”) =P. (5) 
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To find any set of constants d,’ or d;’’ satisfying these equations, the 
joint distribution of the ¢; is required. It may be noted that the joint 
distribution of the z; is a multivariate normal distribution with means 
0 and variances o” and where the correlation between z; and z; is given 


by 


The joint distribution of the ¢; is thus the multivariate analogue of 
Student’s é-distribution defined by Dunnett and Sobel [4]. 

To find solutions to (4) and (5), we require a tabulation of the multi- 
variate Student ¢-distribution. We shall show now that the problem of 
tabulating the multivariate Student ¢-distribution can be reduced to the 
problem of tabulating the corresponding multivariate normal distribu- 
tion. Consider first equation (4) above. It can be written 


P = Prob (a < dj's, 22 < de’s, ++ +, Zp < d,’s) 





, , , (6) 
F(di’s, do's, - + - , dy’s)p(s)ds 


where F (z;, z2, - - - , 2p) is the multivariate normal c.d.f. of the z; and 
p(s) is the probability density function of s. Thus, if F were tabulated, 
it would be fairly easy using a desk calculator to evaluate (6) by nu- 
merical integration for any set of fixed d;’, and hence to find solutions 
to (4) as functions of P, p and n. 

Similarly, (5) can be written 


P = Prob(|a| < dis, | 22| <de”s,---, |zp>| < d,”s) 


“ (7) 
= G(d,'"s, de’’s, - - - , dy’’s)p(s)ds 


—o 


where G (21, 22, - - - , Zp) is the c.d.f. of the |z,;|. Again, if G were tabu- 
lated, we could also evaluate (7) and determine the solutions to (5) as 
functions of P, p and n. 

The functions F and G can be obtained for p=2 from K. Pearson’s 
tables of the bivariate normal distribution [13], although the tabulation 
interval is not fine enough for numerical integration purposes. How- 
ever, for p=2, (4) and (5) can be evaluated directly from the results of 
Dunnett and Sobel [4]. The function F has been tabulated for equal 
values of its arguments and for p $9 by the National Bureau of Stand- 
ards [10] for the special case p;;= 1/2. As will be explained in the next 
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section, Table 1 of this paper was based on this tabulation. It would be 
extremely useful for the purpose of the problem considered in this paper 
to have a corresponding tabulation of the function G. 

Until exact tables are available, it will be necessary in general to rely 
upon approximations to the solutions of (4) and (5). From the results 
given by Dunnett and Sobel [5], we can write, when 


mo / YEN) 


P 
Prob (th < di’,  < dh’, ++ +,tp <d,’) 2 [] Prob(t <d,’) @) 


t=1 





Prob (|t:| < di”, || <d”’,---, |t,| <,”) 


> [J Prob (|t| < d,”’). ) 
t=1 

Thus, upper bounds to the constants d,’ and d,’’ can be determined 
by equating the right-hand sides of (8) and (9) to the desired value of 
P. These calculations involve only the probability integral of the uni- 
variate Student é-distribution, which has been tabulated most recently 
by Hartley and Pearson [8, 12]. 

Since (4) and (5) are each increasing functions of the correlations 
p:j, alternative lower bounds which are closer than the lower bounds 
given in (8) and (9) may be obtained by taking p,;=0. Pillai and 
Ramachandran [14] have tabulated solutions d,’=d,’= - - - d,’ to (4) 
and solutions d,’’=d,’’= - - - =d,’’ to (5) for P=.95 and p,;=0. It 
would be useful to have a tabulation of such equicoordinate percentage 
points of the multivariate Student ¢-distribution in the important spe- 
cial case where p;;=0 for other values of P. 

Lower bounds based on the bivariate Student é-distribution can also 
be obtained, as shown in [5]. Taking d,’=d,’= --- =d,’=d’, d,’’ 


=d,/’= --- =d,’’=d"’, and pi;=p20, these can be written 
Prob (th < d’, tp <d’,---,t, <d’| pi; = p) (10) 
> [Prob (th < d’, ts < d’| ps = p) |?! 


and 
Prob (|| <d”, |4&| <d”,---, |t,| <@”| py = p) 


11 
> [Prob (|u| <a”, || <d’| on = py} 
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The probability within the square brackets can be determined from 
the probability integral of the bivariate Student-¢ distribution [4] with 
correlation p. The bounds obtained from these inequalities are sharper 
than those given by (8) and (9), and in most cases will also be sharper 
than those obtained by using the Pillai and Ramachandran tables [14]. 

There are, of course, infinitely many solutions to (4) and (5). For the 
applications considered in this paper, we will take the constants to be 
equal, viz., di:’=d,’= +--+ =d,’ (=d’, say) and d;’’=d;"’= --- =d,”’ 
(=d’’, say). Besides greatly simplifying the computational problem, 
there are some theoretical grounds for doing so in the case where 
pj=p20, which occurs frequently in practice. For example, it can be 
shown that >> d,;’? and >> d;’”? are each minimized by this choice. 


IV. CONSTRUCTION OF THE TABLES 


Tables 1a and 1b give solutions d’=d,’= - - - =d,’ to (4) for P=.95 
and .99, respectively, for p<9 and p;;=1/2. They are applicable to the 
situation where there are equal numbers of observations on the control 
and the p treatments, viz, No=Ni= - - - =N,. These tables were con- 
structed by numerical evaluation of the integral in (6) using tables of 
the function F computed by the National Bureau of Standards [10]. 
The method used was as follows: For n=5, 10, 20, 25 degrees of free- 
dom, the integral in (6) was calculated for three successive values of d’ 
differing by 0.1 such that the desired value of P was bracketed. The 
required value of d’ was then determined to 3 decimal places by 3-point 
inverse interpolation. For n= © degrees of freedom, the values given 
in tables computed by Bechhofer [1] were used. For the intermediate 
degrees of freedom, the values were obtained by interpolation with 1/n 
as argument. An accuracy of 1 in the second decimal place should be 
achieved by this method. 

Tables 2a and 2b give solutions d’’=d,'’= - - - =d,’’ to the right- 
hand side of (11) for P=.95 and .99, respectively, for p<9 and p,;=1/2. 
The method of constructing these tables was as follows: For n=5, 10, 
20, 40, «» degrees of freedom, the probability in the square brackets of 
(11) was calculated for three successive values of d’’ differing by 0.1 
such that the desired value of P was bracketed, using the expressions 
developed in [4] for the probability integral of the bivariate Student 
t-distribution. The required value of d’’ was then determined by 3-point 
inverse interpolation to 3 decimal places. For the intermediate degrees 
of freedom, the required values were obtained by interpolation using 
1/n as argument. An accuracy of 1 in the second decimal place should 
be achieved by this method. 
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V. OPTIMUM ALLOCATION OF OBSERVATIONS BETWEEN 
CONTROL AND TREATMENTS 


The tables given in this paper were prepared to handle the case where 
equal numbers of observations are available on each of the p treatments 
and on the control. In many practical situations, the experimenter will, 
in fact, wish to allocate the available number of observations equally 
to each group. Where it is feasible, however, it may be advantageous 
to do otherwise. In this section, we will consider the consequences of 
allocating No observations to the control and N, observations to each of 
the other treatment groups, i.e., we will take Ni=N2= -- - =N, and 
No#¥N,. 

Lower confidence limits to the m;— mp will be given by 

—— : 1 1 ; 
X; — Xo — d's W.* We (¢=1,2,---,p), 
where d’ is chosen to satisfy 
Prob (¢; < d’,i = 1,2,---,p) =P, 


the ¢; having the multivariate Student ¢-distribution with 


- 1/ (+1) 
a M, ° 


We will consider the allocation No/N; optimum if it maximizes P for 
fixed value of 


a! 1 + 1 
Ni No 
and fixed total number of observations, No+pN,. It may be noted that 
fixing the total number of observations also fixes n, the number of de- 
grees of freedom associated with the multivariate ¢-distribution, if s is 
defined by (1). Let 


Then (12) can be written 





Prob | t; < 


a Be 
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From (13), it is evident that P can be increased for fixed h by de- 
creasing 


1 i 
Ni No 


It is easy to show, as has been pointed out by several authors, see, for 
example, Finney [7], that 

; a 

N, ™ No 
attains a minimum for fixed No+pN; when No/Ni=+/p. However, this 
choice makes p;;<1/2, which operates to decrease P. 

In order to investigate numerically the effect of different-allocations 
on P, the curves shown in Figs. 1 and 2 computed. Fig. 1 shows P asa 
function of No/N, for p=2 and n=1, 2, 5, 10, ©, with h chosen in each 
case to make P=.95 when N,/N,=1. Fig. 2 shows P as a function of 
No/N, for p=2, 4, 9 and n= ~, with h again chosen to make P=.95 
when No/N,=1. The curves for finite n were computed from formulas 
given in [4]. The curves for n= «© were computed by numerical integra- 
tion using tables of the normal distribution [9], with certain points for 
p=2 checked against Pearson’s bivariate normal tables [13]. 

Figs. 1 and 2 indicate that the optimum allocation occurs where 
N./N1 is only slightly less than \/p except when the number of degrees 
of freedom is small. Some further computations carried out by the 
author for the case of n= «© degrees of freedom indicated that the point 
of optimum allocation becomes even closer to Vp when h is chosen to 
make P= .99 for No/N,=1. However, for h chosen to make P=.75 for 
No/Ni=1, it was found that P <.75 for No/N;:=~/p and the optimum 
value of No/N, was considerably less than \/p. For practical purposes, 
we can thus conclude that, if the experimenter is working with a joint 
confidence coefficient in the neighborhood of P=.95 or greater, then 
the experiment should be designed so that No/Ni=~/p approximately, 
where No is the number of observations on the control and N, the num- 
ber on each of the p treatments. 


VI. APPLICATION TO BIOLOGICAL ASSAY 


An important example involving the multiple comparison of several 
treatraents with a control arises in the biological assay of several drug 
samples relative to a common standard. In the “parallel line” type of 
biological assay, regression lines Y=ao+bX and Y=a,+bX are fitted 
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Relationship Between Allocation Ratio and 


ons Confidence Coefficient for p=2 and Various n. 








E 
| 








| | 
0.8 1.0 1.2 14 1.6 1.8 20 
Allocation Ratio, N./N, 


946 | | 








Fra. 1.—In plotting these curves, d’s/(1/N:)+(1/No) was fixed for each n 
so that P =.95 for No=Ni, where d’s./(1/N1)+(1/N>) is the difference between 
X,.— 2X, and its lower confidence limit. The dotted curve indicates where the 
optimum occurs for each n; the vertical dashed line is drawn at No/Ni1 = «/p = V2. 





to data representing observed responses Y at several log-dose levels X 
of a standard drug preparation S and a test sample U. The estimated 
log-potency of U relative to S is represented by the horizontal distance 
between the two regression lines, 

a — ao VY; = Yy 


Tai ~ Serger (14) 
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Relationship Between Allocation Ratio and Confidence 
Coefficient for n=O° and Various p 





972 
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964 


Confidence Coefficient, P 


952 











| | | 
1.0 1.5 2.0 he 3.0 a3 4.0 
Allocation Ratio, No/N; 
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Fia. 2.—In plotting these curves, d’4/(1/N,)+(1/No) was fixed for each p 
so that P =.95 for Ne=N,, where d’o./(1/N;)+(1/No) is the difference between 
¥,—Xo and its lower confidence limit. The black arrows indicate where the 
optima occur; the white arrows indicate the points where No/Ni=4/p. 





where Xo and X; are the mean log-dose levels, and Yp and Y, are the 
mean observed responses, of S and U respectively. On the other hand, 
the true log-potency may be represented by 

mm — Mo 





where mo, m; and are the expected values of Yo, Y; and b, respectively. 
Assuming that the responses Y are independent and normally dis- 
tributed with common variance o? and means m+ ((X—Xo) and 
m,+8(X—X:1) for S and U respectively, we can obtain confidence 
limits for M by Fieller’s [6] method by considering 
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Yi — Yo — (M — Xo + Xb 
z= (16) 


1 1 Ban. Satis ae 1/2 
art 4 OO ~ 8.4 Tae 
la-tact o+ yet] 





Here, No and N, are the numbers of responses observed on S and U 
re: pectively, and c?c? is the variance of the common slope b. Then z is 
normally distributed with mean 0 and variance o?. If s? is an estimate 
of o? independent of z, based on n degrees of freedom, then z/s has 
Student’s ¢-distribution with n degrees of freedom. Hence, confidence 
limits for M with confidence coefficient P can be obtained by equating 
z?/s* to t*, where t is the P-percentage point of Student’s ¢, and solving 
the resulting quadratic equation for M. The solution may be found, for 
example, in Bliss [3]. 

Now suppose there are p unknown drug samples: l;, U; - - - , U;, to 
be compared with a common standard, S. Extending the above nota- 
tion in an obvious way, we define the p variables, 


Y; — Yo — (M; — Xo + Xb 


wi rs 1 na nh a 1/2 
~~ M; — Xo + X,)%e? 
lw. tw t ° o+ | 





2 


»t=1,2,---,p (17) 


The z; have a joint p-variate normal distribution with means 0, 
common variance go? and correlation between z; and z; given by 


N No 
pi; = (1 +0) / (> + i.¢ “(> it +6) 


where ¢;= M;— Xo+X,)e\/N>. For the p pairs of confidence limits on 
the M; to have a joint confidence coefficient equal to P, we are led to 
consider the joint distribution of the z;/s, which is the multivariate 
Student ¢-distribution as in Section III, with this important difference: 
the correlations p;; are no longer known exactly since they involve the 
unknown parameters M; and M;. Fortunately, ¢; may be expected to be 
fairly smal! in most cases since it is common practice in designing 4 
biological assay experiment to try to arrange the dose levels so that 
Xo—X; is close to M;. Thus, we can obtain confidence limits for the M; 
which have approximately the desired confidence coefficient by assum- 
ing the e; to be negligible, whence 


[VENER a 
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Contour Curves for Correlation Coefficient 
in Biological Assay Case 
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1.0- 
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Fia. 3 


This takes the value } in the usual situation where No=N; (t=1, 
2, - ++, p), in which case the tables in this paper are applicable. 

The contour curves in Fig. 3 show the effect of ¢; and ¢; on p;; when 
No=N,;=N;. In most practical situations, the log-dose levels chosen by 
the experimenter for each drug will cover a wide enough range so that 
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Effect of Correlation Coefficient on 


958 Confidence Coefficient 
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the ¢; are numerically small, in which case they do not exert much 
influence on the value of p;;. In Fig. 4, curves are drawn for degrees of 
freedom equal to 5 and showing the actual P attained as a function 
of p in the case of estimating the log-potencies of two drugs (p=2) 
when N»=Ni=Nz and the tables are used applicable to p=} and 
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P=95%. It may be seen that for a wide range of values of p around 
-=}, P does not differ much from the value 95%. 


VII. MULTIPLE DECISION PROCEDURES FOR COMPARING 
TREATMENTS WITH A CONTROL 


A multiple decision procedure for selecting the “best” of p+1 cate- 
gories when comparing p experimental categories with a control has 
been developed by Paulson [11]. Paulson gives a method for making 
one of the following p+1 decisions: 


Dy: select the control as best 
D;: select the i-th category as best (t=1,2,---, p) 


If Xo, X1, - - - , Aprepresent the p+1 means, each based on N observa- 
tions, Xo being the control, then Paulson’s method consists in picking 
out X*=max (Xi, ---, Xp) and if X¥*— Xo=d,8v/2/N the decision D* 
corresponding to X* is made, whereas if X*—Xo<),8\/2/N the de- 
cision Do is made. The constant , is chosen so that the probability of 
making decision Do, when all p treatments are equivalent to the control, 
is equal to a pre-assigned value P. Clearly, d;’=, must be a solution 
of (4). Thus, Tables 1a and 1b of this paper give the required values of 
\, for P=.95 and .99. 

In some situations it may be appropriate to consider the following set 
of possible decisions from which a choice is to be made: 


{ Do: select the control as best 

;; select the i-th treatment (i=1, 2, - - - , p) as the only one better 
than the control 
select the i-th and j-th treatments (i, j=1, 2,---, p; 7¥)), 
without ordering them, as the only two better than the control 


D,2,...,p: select all p treatments, without ordering them, as better 
than the control 





\ 


The following procedure is proposed for choosing one of the above 2 
decisions on the basis of the p+1 observed means Xo, X1, - - -, Xp, each 
based on N observations, and the independent estimate s of the com- 
mon standard deviation: 


(Accept Do if X:—_Xo<ds\/2/N for all 7 

Accept D, if X¥;—Xo2ds\/2/N and X;—Xo<ds\/2/N for all j¥i 

Accept Di if X:—Xozdsv/2/N, X;—Xozdsy 2/N and Xa— Xo 
<ds\/2/N for all ki or j 





(Accept Di 2,...,» if X:— Xo2dsv/2/N for all 7 
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By choosing d to satisfy (4), we will be assured that the probability of 
accepting Dy when all the treatments are equivalent to the control jg 
equal to a pre-assigned value P. In fact, d will then be identical with 
Paulson’s \,. Table 1 gives the required values of d for P=.95 and .99, 

It is also possible to determine the size N of sample required to 
achieve a specified probability of accepting some other decision when 
all the treatments are not equivalent to the control. For example, 
suppose all the treatments are equivalent to the control except one, 
say the first one, which is better than the control. The correct decision 
to make would then be D;. Suppose mp =m2=m3= + - - =mM,=m—5. 
Then the probability of making decision D, can be written 

P, = Pr (D,| mo = m = +++ = my = m — 8) 
= Pr (Xi =e Xo = dsx/2/N, X: = Xo 
< ds\/2/N, +--+, Xp — Xo < dsvV/2/N) 
= Pr(t; 2d, <d,--+-,t, < d) 


in X; — Xo 
3/2/N 


(@@ = 1,2,+--+, p) 


5 = 


We note that #, - - - , ¢, are Student ¢-variates, but 4, is a non-central 
t-variate. 
To obtain bounds for P;, write 


P, = Pr (i <d,<d,---,t,<d) (20) 
= Pr (t, < d)-Pr (2 < d,---,t, < d) 
This inequality follows from Dunnett and Sobel [5]. Since 
P, + Pi = Pr(tzp <d,---,t,<d), (21) 
it follows that an upper bound for P, is given by 
P, <= Pr( 2 d)-Pr(a<d,---,t,<d). (22) 
On the other hand, an obvious upper bound on P, is 
P, s Pr (th < d) (23) 
From (21) and (23), we get the following lower bound for P,, 
P, 2 Pr(2<d,---,t,<d) —Pr(t <d). (24) 


The difference between the bounds given by (22) and (24) is 
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Bounds on P, for n=©O and p=2,9 
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Pr (4, <d) -[1—Pr(&<d, - - - , tp<d)] which will be small in the region of 
interest where P, is fairly large. In Fig. 5, the two bounds for P, are 
plotted as functions of »/(N/2)6/o for d.f.= ©, p=2 and 9, and where 
d is chosen in each case so that P=.95. To compute the probability 
involving tz, - - - , tp, in (22) and (24), tables of the multivariate nor- 
mal distribution computed by the Bureau of Standards [10] were used. 
If it were desired to compute the bounds on P; for a finite number of 
degrees of freedom, the probability involving f2, - - - , t, could be com- 
puted by numerical integration on the Bureau of Standards tables, 
However, since in most practical situations the number of degrees of 
freedom will be large, it may be sufficient to assume d.f.= © and use 
the Bureau of Standards tables directly. 

Using Fig. 5, it is possible to determine the required value of N cor- 
responding to any given values of 5/o and P;. The following table shows 
the value of \/V5/o corresponding to p=1(1)9 to achieve P; =.80 for 
P=.95 and d.f.= 0. 








VNi/o 


3 





.52 
-05 
30 
-46 
.58 
.67 
.74 
81 
86 





1 3 
2 4 
3 4 
4 4 
5 4 
6 4 
7 4 
8 4 
9 4 





For example, suppose that the experimenter has p=5 treatments to 
compare with a control. Table 1a will provide the value of d to give a 
probability of P=.95 that none of the treatments will be declared su- 
perior to the control when, in fact, all are equivalent to the control. If, 
in addition, the experimenter wants to achieve a probability of 
P,=.80 of correctly selecting a superior treatment, if there is one which 
is, say, one standard deviation better than the control, the above table 
shows that »/V5/o=4.58 whence N =21 on substituting 5/o=1. Thus 
21 observations should be taken on each of the treatments and on the 
control. This will provide (p+1)(N—1)=120 degrees of freedom for 
estimating the variance, so that d=2.26 from table la. While the table 
used above to determine N is based on d.f. = ©, the result should not be 
much different for d.f. = 120. The experimenter would then take 21 ob- 
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servations in each group, observe Xo and X; (¢=1, 2, ---, 5), caleu- 
late s? from equation (1), and declare any treatment superior to the 
control which gives a mean X; greater than Xo+2.26s\/2/21. 


TABLE 1a* 


TABLE OF ¢ FOR ONE-SIDED COMPARISONS BETWEEN p TREAT- 
MENT MEANS AND A CONTROL FOR A JOINT 
CONFIDENCE COEFFICIENT OF P=95% 








p, NuMBER Or TreatMENT Means (Exciupina THE ContrRot) 





1 2 4 6 7 8 9 





.02 
.94 
.89 
86 
.83 


44 : 85 
34 : 71 
.27 . 2.62 
.22 : -55 
18 . 50 


.08 16 
.92 -00 
.82 .89 
.74 81 
.68 75 


3.30 
.12 
01 
.92 
86 


bt tS bh b& be 
tN bh} b&b bo 
Nw hd bd & 
Nw dw & Ww 
No hw & & 


81 
.80 
78 
77 
.76 


15 : 47 
13 : 44 
ll ‘ 41 
-09 : .39 
.08 ‘ 37 


.70 
.67 
64 
61 
.59 


81 
77 
74 
71 
69 


.53 
.50 
48 
-46 


to hS &S bw bw 
Nt hw b& bt 
Le SS) 
Nw hw hd bd 
tS wo WwW 


75 
75 
74 
.73 
.73 


.07 . .36 
.06 : 34 
.05 . 33 
-04 ‘ .32 
.03 ‘ 31 


44 
43 
.42 
41 
.40 


.57 
56 
54 
.53 
.52 


.67 
65 
64 
.62 
-61 


ww hw hd 
Nw WwW bt 
to bw dS db bt 
Nw hw WY bd bw 
Nw NH W 


72 
71 
.70 
.68 
.67 


.03 ‘ .30 
01 ‘ .28 
-99 ; 25 
97 , .23 
95 ‘ 21 


.39 
36 
.33 
31 
.28 


-51 
48 
45 
37 .42 
35 .39 


-60 
.57 
.54 
51 
48 


— ee = DO DO 
bo Nw Nw Wb be 
Nd bh hw b& 
bo Nw hw db be 
ww Nw ND 


44 


120 -66 
inf. -64 


—y 


-93 ‘ 2.18 -26 | 2.32 | 2.37 41 


2 2 45 
92 . .16 | 2.23 | 2.29 | 2.34 .38 


42 





























_ 
Nb 





* Table la gives a solution d;’ =t¢ to equation (4) in the text for P =.95 for the case p¢j = 1/2. 
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TABLE 1b* 


TABLE OF t FOR ONE-SIDED COMPARISONS BETWEEN p TREAT. 
MENT MEANS AND A CONTROL FOR A JOINT 


CONFIDENCE COEFFICIENT OF P=99% 








p, NuMBER Or TREATMENT Means (Exciupine THe Contro.) 








df. 1 | 2 | 3 | eis ta 718s i| 
5 3.37 | 3.90 | 4.21 | 4.43 | 4.60 | 4.73 | 4.85 | 4.94 
6 3.14 | 3.61 | 3.88 | 4.07 | 4.21 | 4.33 | 4.43 | 4.51 
7 3.00 | 3.42 | 3.66 | 3.83 | 3.96 | 4.07 | 4.15 | 4.23 
8 2.90 | 3.29 | 3.51 | 3.67 | 3.79 | 3.88 | 3.96 | 4.03 
9 2.82 | 3.19 | 3.40 | 3.55 | 3.66 | 3.75 | 3.82 | 3.89 
10 2.76 | 3.11 | 3.31 | 3.45 | 3.56 | 3.64 | 3.71 | 3.78 
11 2.72 | 3.06 | 3.25 | 3.38 | 3.48 | 3.56 | 3.63 | 3.69 
12 2.68 | 3.01 | 3.19 | 3.32 | 3.42 | 3.50 | 3.56 | 3.62 
13 2.65 | 2.97 | 3.15 | 3.27 | 3.37 | 3.44 | 3.51 | 3.56 
14 2.62 | 2.94 | 3.11 | 3.23 | 3.32 | 3.40 | 3.46 | 3.51 
15 2.60 | 2.91 | 3.08 | 3.20 | 3.29 | 3.36 | 3.42 | 3.47 
16 2.58 | 2.88 | 3.05 | 3.17 | 3.26 | 3.33 | 3.39 | 3.44 
17 2.57 | 2.86 | 3.03 | 3.14 | 3.23 | 3.30 | 3.36 | 3.41 
18 2.55 | 2.84 | 3.01 | 3.12 | 3.21 | 3.27 | 3.33 | 3.38 
19 2.54 | 2.83 | 2.99 | 3.10 | 3.18 | 3.25 | 3.31 | 3.36 
20 2.53 | 2.81 | 2.97 | 3.08 | 3.17 | 3.23 | 3.29 | 3.34 
24 2.49 | 2.77 | 2.92 | 3.03 | 3.11 | 3.17 | 3.22 | 3.27 
30 2.46 | 2.72 | 2.87 | 2.97 | 3.05 | 3.11 | 3.16 | 3.21 
40 2.42 | 2.68 | 2.82 | 2.92 | 2.99 | 3.05 | 3.10 | 3.14 
60 2.39 | 2.64 | 2.78 | 2.87 | 2.94 | 3.00 | 3.04 | 3.08 
120 2.36 | 2.60 | 2.73 | 2.82 | 2.89 | 2.94 | 2.99 | 3.03 
inf. 2.33 | 2.56 | 2.68 | 2.77 | 2.84 | 2.89 | 2.93 | 2.97 
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* Table 1b gives a solution d;’ =¢ to equation (4) in the text for P =.99 for the case ps; =1/2. 
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TABLE 2a* 


1119 


TABLE OF t FOR TWO-SIDED COMPARISONS BETWEEN p TREAT- 
MENT MEANS AND A CONTROL FOR A JOINT 


CONFIDENCE COEFFICIENT OF P=95% 











p, NumBer Or TREATMENT Means (Exciupine THE ContTROL) 



































d.f. 1 2 | 3 4 5 6 7 8 9 
5 2.57 | 3.03 | 3.39 | 3.66 | 3.88 | 4.06 | 4.22 | 4.36 | 4.49 
6 2.45 | 2.86 | 3.18 | 3.41 | 3.60 | 3.75 | 3.88 | 4.00 | 4.11 
7 2.36 | 2.75 | 3.04 | 3.24 | 3.41 | 3.54 | 3.66 | 3.76 | 3.86 
8 2.31 | 2.67 | 2.94 | 3.13 | 3.28 | 3.40 | 3.51 | 3.60 | 3.68 
9 2.26 | 2.61 | 2.86 | 3.04 | 3.18 | 3.29 | 3.39 | 3.48 | 3.55 

10 2.23 | 2.57 | 2.81 | 2.97 | 3.11 | 3.21 | 3.31 | 3.39 | 3.46 

11 2.20 | 2.53 | 2.76 | 2.92 | 3.05 | 3.15 | 3.24 | 3.31 | 3.38 

12 2.18 | 2.50 | 2.72 | 2.88 | 3.00 | 3.10 | 3.18 | 3.25 | 3.32 

13 2.16 | 2.48 | 2.69 | 2.84 | 2.96 | 3.06 | 3.14 | 3.21 | 3.27 

14 2.14 | 2.46 | 2.67 | 2.81 | 2.93 | 3.02 | 3.10 | 3.17 | 3.23 

15 2.13 | 2.44 | 2.64 | 2.79 | 2.90 | 2.99 | 3.07 | 3.13 | 3.19 

16 2.12 | 2.42 | 2.63 | 2.77 | 2.88 | 2.96 | 3.64 | 3.10 | 3.16 

17 2.11 | 2.41 | 2.61 | 2.75 | 2.85 | 2.94 | 3.01 | 3.08 | 3.13 

18 2.10 | 2.40 | 2.59 | 2.73 | 2.84 | 2.92 | 2.99 | 3.05 | 3.11 

19 2.09 | 2.39 | 2.58 | 2.72 | 2.82 | 2.90 | 2.97 | 3.04 | 3.09 

20 2.09 | 2.38 | 2.57 | 2.70 | 2.81 | 2.89 | 2.96 | 3.02 | 3.07 

24 2.06 | 2.35 | 2.53 | 2.66 | 2.76 | 2.84 | 2.91 | 2.96 | 3.01 

30 2.04 | 2.32 | 2.50 | 2.62 | 2.72 | 2.79 | 2.86 | 2.91 | 2.96 

40 2.02 | 2.29 | 2.47 | 2.58 | 2.67 | 2.75 | 2.81 | 2.86 | 2.90 

60 2.60 | 2.27 | 2.43 | 2.55 | 2.63 | 2.70 | 2.76 | 2.81 | 2.85 

120 1.98 | 2.24 | 2.40 | 2.51 | 2.59 | 2.66 | 2.71 | 2.76 | 2.80 

inf, 1.96 | 2.21 | 2.37 | 2.47 | 2.55 | 2.62 | 2.67 | 2.71 | 2.75 





* Table 2a gives a solution dg” =t which makes the right-hand side of inequality (11) in the text 
equal to .95 for the case p =1/2. This may be used as an approximate solution to equation (5) in the 


text for P =.95 for the case pjj = 1/2. 
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TABLE 2b* 


TABLE OF t FOR TWO-SIDED COMPARISONS BETWEEN p TREAT. 
MENT MEANS AND A CONTROL FOR A JOINT 
CONFIDENCE COEFFICIENT OF P=99% 








p, NumBeR Or Treatment Means (Exciupine Tue Controz) 





d.f. 1 2 3 4 5 
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* Table 2b gives a solution d;”’ =t which makes the right-hand side of inequality (11) in the text 
equal to .99 for the case p=1/2. This may be used as an approximate solution to equation (5) in the 
text for P =.99 for the case pyj = 1/2. 
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MAXIMUM LIKELIHOOD ESTIMATION OF THE Dis. 

PERSION PARAMETER OF A CHI-DISTRIBUTED RADIAL 

ERROR FROM TRUNCATED AND CENSORED SAMPLES 
WITH APPLICATIONS TO TARGET ANALYSIS* 


A. CiirrorD CoHEN, JR. 
The University of Georgia 


The radial error, r =(2:2-+22?+ +--+ +2,*)2, where com- 
ponents z,(j=1, 2,--- p) constitute a set of p independent 
random variables, each of which is normal (0, c), has the fre- 
quency function [2r/e*]f,(r?/o%) in which fp(x?) is the x? fre 
quency function with p degrees of freedom. This distribution 
is of interest in a class of target analysis problems where the 
origin or “center of impact” is known or can be assumed as 
known. For p=2 and p=3, ¢ is important as a measure of 
error dispersion and hence as a measure of weapon-system 
accuracy. Here, we are concerned with estimating o from 
truncated and censored samples as well as from complete 
samples. Maximum likelihood estimating equations are ob- 
tained in these cases, and where explicit estimators are not 
possible, the equations are reduced to forms which permit 
solution by interpolation with the aid of normal curve tables of 
areas and ordinates. To facilitate solution in the cases of 
truncated samples for p =2 and p =3, tables and graphs of the 
estimating functions are given. An explicit estimator is ob- 
tained in the case of a censored sample when p =2. Asymptotic 
variances of the estimates are obtained for each of the cases 
considered. Illustrative examples relating to target analysis 
studies are included. For use when the origin is not known, a 
graphical procedure for estimating its coordinates based on 
use of an “estimating circle” of fixed radius, located to enclose 
a maximum number of sample points or “hits” is described 
for the cases in which p =2. 


I, INTRODUCTION 


HIS paper deals specifically with the distribution of a generalized 
n-dimensional radial error defined as 


r= Jae + ai +--+ + 2,%, (1) 





where components x; (j=1, 2, - +--+ p) are independent random vari- 
ables, each of which is normally distributed about a known mean with 
standard deviation oc. It is well known (c.f., for example, Cramér [3], 
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p. 233) that (r/c)* has a chi-square distribution with p degrees of free- 
dom; that is, 


Q-rl2 ff p\2@/2-1) l/r? 
2/52) = nod nal ih 
Solei/e") I'(p/2) ( = exp| 2 Gi | @) 


It follows that (r/c) has a chi distribution, and for the frequency func- 
tion of r, which is of primary interest here, we have 


Q-(v-2)/2 / p\PH1 L/r\? 
oe cmc aie > aml one a 
Srl) ac) exp | 5 (=) | lites @) 


With reference to f,(r), o is to be regarded as a parameter of scale or of 
dispersion. The frequency function f,(r) is a generalization of the two 
dimensional frequency function 


far) = (r/o?) exp [—(r/0)?/2], (4) 


of radial errors on a plane target in which errors are distances from 
the target center (or a known center of impact) to points where missiles 
strike the target. 

The dispersion parameter, a, of (4) is of interest in numerous target 
analysis problems as a measure of weapon-system accuracy, and it is 
often of practical importance to estimate it, not only from complete 
samples, but from samples that may be truncated or censored. To be 
complete, all sample values of r must be measured. When rounds miss 
a target, and their errors thus cannot be measured, truncated or cen- 
sored samples result. A truncated sample is one in which all r exceeding 
a known terminal, r., are omitted, and the number of such omissions is 
unknown, the effect being the same as if all errors for which r>r, were 
eliminated from the population. A censored sample is one in which the 
number of errors exceeding r, is known, but such errors are not other- 
wise measured. In both truncated and censored samples, actual meas- 
urements are recorded for all rSro. 

The work reported on here is related to previous investigations of 
truncated samples from Type III distributions by the writer [2] and 
by Des Raj [4]. It is also related to the estimation of parameters of 
univariate normal populations, a subject which has been considered by 
numerous writers (c.f., [1] and other references listed therein). 


II. COMPLETE SAMPLES 


Consider random selections from a population distributed according 
to (3) when observation of r is unrestricted so that any value within the 
possible range 0SrS @ can be observed and measured. A sample con- 





1124 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1055 


sisting of n such observations is said to be complete (or unrestricted) of 
size n, and its likelihood function is given by 


P = 2-"(0-2)/2g-p"[T(p/2) | Ire] exp | - > r2/2e*|. (5) 


Taking logarithms, differentiating and equating to zero, we have 
aL 


1 n 
£4. oR ectwes 0 
0a o o* ‘5 


where L =log P. The maximum likelihood estimator of o follows as 


$= y/ > r2/pn, (7) 


where the symbol (~) serves to distinguish the estimator from the 
parameter estimated, a convention observed throughout this paper. 


Ill. TRUNCATED SAMPLES—-NUMBER OF ELIMINATED 
OBSERVATIONS UNKNOWN 


The likelihood function of a sample consisting of n random observa- 
tions from a population distributed according to (3) when each observa- 
tion is subject to the restriction 0SrSr, is 


P= 2ee-nire-oe[r(p/2)}-*| Tre] [1 = Tal 
(8) 


“exp |- Dra/2et], 


where 


Q-(—-2)/2 wo r\?-1 l/r dr 
falrete) = TOT In (=) ed [- 76) }E- 


If we let 
t= r/o and thus & = r/o, 


equation (9) reduces to 
Iylroto) = Iy(&.) = f o(8)de, where 
§o 
Q—(p—2)/2 


6,(€) = T(p/2) -! exp (—§£?/2). 
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Taking logarithms of (8), differentiating and equating to zero, we have 
al 1s 
Co 


Differentiating (11), we obtain 


OI,(&) 
ae re Tee ( fs), 


and with this result, the preceding equation reduces to 


_ Mo 6p(£o) 
o¢ 1 — 1,(&) 
Equation (13) further reduces to 


» r?/nro = G,p(k), 


1 o( £o 
Gp(&o) = (2 oe et.) |. 


& L& 1 — Ip(&) (15) 


Equation (14) can be solved for ¢, using standard iterative procedures 
or by interpolation in appropriate tables. From (10) it follows that 


6 = re/b. (16) 


Truncated Sample From Population of Dimension Two. When p=2, 
62( £0) ” & exp —&,?/2 _ V Qrtob(Eo), and I3(&) = V 2x(Eo), where (t) 
=(1/+/2r) exp—??/2, the standardized normal curve ordinate. For 
this case, estimating equation (14) becomes 


ure 4G) 


1 2 P 
== ») 17 

b=. oo 
For any given ¢, G:(¢) can be evaluated from an ordinary table of nor- 
mal curve ordinates, and simple interpolation can be used to solve (17) 
for ¢.. In view of ‘the particular interest attached to this case, an 
abbreviated tabulation of G.() is included in Table 1, and a graph of 
this fanction is given in Fig. 1. With dor"r2/nr.? computed from the 
sample, ¢, can be read directly from the graph with sufficient accuracy 
for many purposes. More precise values can be obtained by interpola- 
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TABLE 1 
THE FUNCTIONS G,(¢) AND G,(é) 


$(E) _3 2t4() 
* 2 1—[280(@) +) 


G2(€) G,(E) 


-34348 -45758 
. 32960 - 44330 
.31562 -42857 
.30164 -41349 
. 28775 .39814 


2 
G8) = GO ob) 











ote 


G,(€) G;(€) 


err 





-50000 -60000 
-49958 -59965 
-49833 .59863 
-49625 -59691 
-49333 -59450 


Romeo 
mynwnny 
mote © 


-48959 .59138 
-48501 .58756 
-47960 -58303 
-47338 .57777 
- 46634 .57178 


. 27404 .38261 
- 26061 -36702 
. 24753 -35145 
- 23486 - 33602 
. 22267 - 32082 


DONA 
mwnwnty 
DWUAA 


-45851 -56505 
-44989 .55757 
-44051 .54935 
-43041 -54037 
-41961 .53064 


-21099 . 30594 
. 19986 . 29146 
- 18930 .27745 
. 17932 - 26397 
- 16991 - 25106 


Pwnmes 
Cw ew & & 
RPwneo 


-40817 -52018 
.39614 -50899 
.38358 .49710 
.37056 -48453 
.35717 -47134 


- 16107 23875 
- 15278 . 22705 
- 14503 -21598 
.13777 - 20553 
. 13099 . 19569 


5 
6 
7 
8 
9 


Ww & & Ww 
Conan 


. 12466 - 18643 
-09873 . 14800 
-08000 - 11999 


a, > 
ono 





tion in Table 1, and when still greater accuracy is demanded, additional 
values of G2(¢) can be computed from the normal curve ordinates for a 
finer interval of interpolation. Once ¢ has been determined with the 
necessary accuracy, ¢ follows from (16). 

Truncated Sample From Population of Dimension Three. Radial 
errors of dimension three are of interest in certain weapon-system ac- 
curacy studies involving three dimensional targets. The distribution of 
r in this case is also of interest in studies of molecule velocity with 
respect to a three dimensional system of rectangular coordinates in 
which r is interpreted as velocity rather than as target error. In view of 
this practical interest, more explicit estimators are given. 
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With p=3, 63(&) =2&.26(t), Is(t.) =[2&p(&) +1i(&)], and equation 
(14) becomes 


nm 
Dr? 
1 


NP yo? 


Eo) 
1- [2t.6(E.) + I,(&.) | 


We note that 1,(£.) =2/¢,°¢(#)dt, which is double the area under the 
normal curve to the right of § =&. The manner of solving (18) for ‘ is 
essentially the same as in the two dimensional case. To facilitate this 
solution, G;(¢) is also included in Table 1 and its graph is given in 
Fig. 1 along with that of G.(é). 





3 2 
“"" = G;(&.). (18) 


IV. CENSORED SAMPLES—NUMBER OF UNMEASURED 
OBSERVATIONS KNOWN 


If we make random observations on a population distributed ac- 
cording to (3), record measurements of each error for which OSrSr., 
while counting but not otherwise measuring those for which r>r, until 





1128 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1955 


m measured observations have been accumulated, the sample obtained 
is said to be censored. Let n, designate the number of censored observa- 
tions for which r>r,, and the likelihood function for a sample of this 
type may be written as 


P= kQ—"(@—2)/2g—pn[T'(p/2) }- | Il re] Ip" 
(19) 


‘exp |- Dre/2e], 


where k is a constant and J,(¢,) is given by (11). Taking logarithms of 
(19), differentiating and equating to zero, we obtain 


OL No al, (20) 


Using (12), this result reduces to 


ur E ry p(B) 


a 


& L & n I,(&) 

If instead of fixing n, the number of measured observations, in ad- 
vance of selecting a sample, we fix the total number of observations, 
n+p, the same likelihood function (19) and the same estimating equa- 
tion (21) are obtained. In this case, the estimating equation would not 
have a solution when n=0, but this limitation is of little practical 
importance. The solution of (21) for ¢, in the general p-dimensional 
case is essentially the same as for (14) in the truncated case. 

Samples considered here are of the fixed terminal type; that is, r, is 
fixed prior to sampling. In some situations, fixed size (variable termi- 
nal) censored samples may arise in which the n, (fixed) largest errors 
out of a fixed total number of observations are unmeasured. Estimating 
equations for samples of this latter type are, however, identical with 
those given here for corresponding fixed-terminal samples if r, is re- 
placed with r,, the largest of the measured sample observations. 

Censored Sample From Population of Dimension Two. When p=2, 
62(&)/I2(€) =, and (21) becomes 


| = H,(é,). (21) 
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Using (10), this reduces to 


which, in this case, is a simple explicit estimator of the required 
parameter. 

Censored Sample From Population of Dimension Three. When p=3, 
65( £0) /Ia(to) = &o?/[éo+11(£.)/2(éo)], where as previously indicated I,(&) 
is twice the area under the normal curve to the right of §=£ and 
¢(é) is the normal curve ordinate. In this case, (21) becomes 

2 
2X a 3 Neko 








——— = ——= ° (24) 
nr? sé? ml & + Ti(&)/24(E,) ] 

With the aid of tables of normal curve areas and ordinates, this equa- 
tion can be solved for ¢ by interpolation as already indicated for the 
general case of (21). For a first approximation, the value of £ read from 
the G, curve of Fig. 1 should suffice. 


V. RELIABILITY OF ESTIMATES 


Complete Samples. For complete samples, (pné?/o?) has a chi-square 
distribution with pn degrees of freedom. To prove this, we note that 
each component z; of r is normal (0, «) and thus z;/o is normal (0, 1). 
Let x;; designate the ith observation of z;, and }>;" (2;;/c)?, which is 
(x;/o0)? summed over all sample observations, has a chi-square distri- 
bution with d.f.=n. Since the sum of independent chi-square dis- 
tributions is itself a chi-square distribution with degrees of freedom 
equal to the sum of the respective degrees of freedom of the component 
distributions, and since Dd j-1” Din” (x;:/0)* = (pné*/o?), the stated re- 
sul’ follows. 

Confidence intervals for o can be determined as 
/pnée? pné? 
\ =l-—a, 


P} c¢e< 
V x2? x1? 


(25) 


where ¢ is given by (7) and x? and x2” are read from standard chi- 
square tables with d.f.=pn such that P[x?> x.2]=1l—a/2, and 
P[x?> x2?] =a/2. 

Siuce (pné?/o?) has a chi-square distribution with d.f.=pn, then 
(\/pn/c)é has a chi distribution with the same degrees of freedom. 
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Kendall [5], p. 294 gives the moments of the chi distribution, and using 
his results, the exact variance of ¢ is 


- m+1\ )27 
(7) 
a? 2 


V(¢) = —] 2pn — ; (26) 


2pr 
a "(3) 
a 2 a 


An expansion based on an extended form of Stirling’s formula, also 
given by Kendall permits (26) to be written as 











v= f1-+--] 27 
2pm 4pn (27) 


For the mean, Kendall’s results give 


1 1 
— i es eae en 3 
mz o| oa + 32(pn)? + | (28) 


For large values of n, confidence intervals for ¢ can be approximated 
from the normal distribution with mean and variance as given by (27) 
and (28). 

Similar exact sampling results are not available for restricted sam- 
ples, but asymptotic variances for each of the maximum likelihood 
estimates, including that based on a complete sample, are given by 


Lh —1 
Asy. Var. (¢) = — E )| ‘ (29) 


a0? 


For restricted samples, this leads to specific variances as given below. 
Truncated Samples. Although the result in this case is not claimed 
to be obvious, straightforward manipulation gives 


(—) soon £0 p(£o) 
E(—) = - 1- 
0c? a 2p[l is I(t) | 


£0p(6) ] 
det nto t ’ 
E ae SE } 


Using tiis result, the asymptotic variance in the two-dimensional case 
is given by (29) as 


” a ” fo" $(0)(é) “ 
an Yer. = ot Tey 





(30) 
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Censored Samples. Again in this case, after some straightforward 
though involved manipulation, we find 


E (—) 2pE(n) { f.p(to) 
pion, PE en 
do? o? 2p[1 — I,(é)] 


£Op(Eo) 
| &? — (p — 2) — 
[ ities eg 


Using this result, the corresponding asymptotic variance for a two- 
dimensional population is given by (29) as 


Asy. Var. (¢) = o?/4E(n). (33) 


Throughout this paper, n designates the number of measured ob- 
servations in each sample considered, and E(n) which appears in equa- 
tions (30-33) is the expected value of n. With E(n) appropriately 
evaluated, the same variance formulas are applicable regardless of the 
sampling scheme employed. Values of E(n) for three sampling schemes 
that seem most likely to occur are given below. 

(i) The experiment is continued until n measured observations are 
obtained in the range 0<r<r,. Since n is a fixed number in this case, 
E(n) =n. 

(ii) A fixed total of say, N, (=n-+n,) observations are selected, 
but only those for which OSrSr, are measured. The number of 
measured observations, , in this case is a random variable and 
E(n) = N[(1—I,(&)]. 

(iii) The n smallest observations out of a total of N are measured. 
In this case both n and N are fixed numbers, but r, is a random variable. 
As already mentioned the estimators in this case are the same as those 
given for the censored samples of Section IV, provided r, is replaced by 
rn, Where r, is the largest of the n measured observations in a given 
sample. With n fixed, E(n) =n as in case (i) above. 





VI. ILLUSTRATIVE EXAMPLES 


To illustrate the practical application of results obtained in this 
paper, we employ a sample selected from a population distributed 
according to (3) with p=2 and o=10. The actual selection was made 
using a table of random numbers, and although certain approximations 
were involved in obtaining observations, the sample appears to be 
adequate for the intended purpose. By appropriately adding informa- 
tion, the same basic data of the truncated sample also serves to illus- 
trate estimation from censored and complete samples. 





1132 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 10955 


Truncated Sample. In this case, we let r,=26, and the sample con- 
sists of n=25 observations, each of which is less than or equal to r,. 
>:* r2=3561, and }oi® r2/nr.2=3561/(25)(26)?=0.21071. Inter- 
polating from G,(¢) in Table 1, we have ¢=3.003, a result which might 
be read with only slightly less accuracy from the corresponding graph 
of Fig. 1. From (16), we have ¢=26/3.003 =8.66. Using this value for 
a, which for illustrative purposes, is assumed to be unknown, we obtain 
V (é) from (31) as 1.27481 and of =~ V(é) =1.129. 

Censored Sample. To the truncated sample, we add information of 
the occurrence of n,=2 observations for which r>r,. Otherwise, the 
sample remains unchanged. The required estimate is computed from 
(23) as é=+/[3561+ 2(26)?]/2(25) =9.91. Using this value for o which 
is still assumed to be unknown (for purposes of this illustration), we 
approximate V(é) from (33) as 9.912/4(25) =0.9826, and o7=0.991. 

Complete Sample. To the censored sample, actual measurements of 
the two censored observations were added to produce a complete 
sample with n=27, and for the sample thus formed, )>,”” r?=5019. 
From (7) with p=2, we compute ¢=+/5019/2(27) =9.64, a value 
which differs by only a small amount from the censored sample esti- 
mate of the previous paragraph. The variance of this estimate is com- 
puted from (27) as V(é) =0.8520, and ¢>=0.923. 





The above results are summarized in the following table along with 
0.95 confidence limits based on the variances as computed and the 
normal curve approximation for the distribution of @ in each of the 
cases considered. 








Type of Sample 0.95 Confidence Interval 





Truncated ’ 6.45 <o <10.87 
Censored ‘4 7.97 <0 <11.85 
Complete ‘ 7.79 <oe0<11.41 





VII. SOME FURTHER REMARKS 


The origin from which radial errors are measured is not known in all 
target analyses and its coordinates which are means of the component 
distributions must be estimated from the sample. This situation pre- 
sents no essential difficulties in unrestricted samples since sample means 
are maximum likelihood estimators of population means, but in re- 
stricted samples it introduces some major complications. In the two- 
dimensional case, however, an estimating circle to be located graphi- 
cally can be employed to advantage. Consider the two-dimensional 
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plane of the distribution of z and y. Require that a circie of fixed radius 
be so located that the probability of a pair (2, y) falling in that circle 
is a maximum. The center of such a circle must be located at the point 
(mz, my). Since mz and my, are unknown, we locate our “estimating 
circle” by placing it over the scatter diagram of an observed sample 
and shifting its position until the maximum number of points or 
“hits” is enclosed. The coordinates of the center of this circle are then 
taken as estimates of the means, m, and my. With small samples, we 
may not always find a unique position for a circle of fixed radius and 
it may be necessary to adjust the radius to alleviate this difficulty. 
A circle which encloses from 70-90 per cent of all sample points will 
be satisfactory in ordinary circumstances. 

As an alternate method of determining estimates, we may locate 
the estimating circle as described above and based only on points inside 
this circle, use as estimators 


n nk 
m,** = > x;/N, and m,** = _ y:/Nk, 
1 1 


where nm, is the number of points inside the circle. 

When means are estimated as suggested in this section, the variances 
of Section V are not strictly applicable. If, however, the number of 
points neglected in determining estimates of the means is small, these 
variances should furnish reasonable approximations to the correct 
values. 

An example of hits of a hand thrown dart against a rectangular 
plane target affords an opportunity to illustrate how estimates of this 
section are determined. Out of 20 throws at the target, one resulted 
in a miss. The scatter diagram of the remaining 19 hits is shown to 
scale in Fig. 2, The coordinates of these 19 hits referred to the lower 
left corner of the target as an origin are given below. 














An estimating circle of radius r, was located so that it encloses 15 
points, which is the largest number that can be enclosed without in- 
creasing the radius. Coordinates of the center of this circle (3.5, 5.2) 
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Fic. 2. Scatter diagram of darts on a target. 





are estimates of m, and m, respectively. The terminal radius is now 
the distance from the center of the estimating circle to the nearest 
edge of the target. For our sample, r,=3.50, and we consider any hit 
outside the circle thus defined as a censored observation. If such 
observations are treated otherwise, estimator (23) will not be applica- 
ble. Although this results in some loss of information in certain cases, 
the loss is almost negligible if r, exceeds two standard deviations. In 
the present illustration, only the single throw which missed the target 
entirely gave a point outside this circle. Accordingly, we now have a 
censored sample for which n= 19, n,=1, and r,=3.5. Using the center 
of the estimating circle as the origin, we compute > r;? =80.91 for 
the 19 observed hits, and from (23) we estimate ¢ as 





o** = 4/[80.91 + 1(3.5)?]/2(19) = 1.57. 


The designation (**) serves to recognize that the estimate is not to 
be considered as a maximum likelihood estimate in view of the manner 
of estimating m, and m,. The radius of the estimating circle in this 
instance is 1.43 times the standard deviation, and in standard units, 
the terminal radius, § =3.5/1.57 =2.23. 
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THE DISTRIBUTION OF THE QUOTIENT OF MAXIMUM 
VALUES IN SAMPLES FROM A RECTANGULAR 
DISTRIBUTION 


V. N. Murry 
Central Tobacco Research Institute, India, University of North Carolina 


When there is a priori knowledge that two samples have 
been drawn from rectangular populations with the same lower 
bounds, the hypothesis that both samples have been drawn 
from the same population can be tested by means of the 
quotient of the maximum values. The distribution of this 
statistic is derived, and its properties studied. Explicit expres- 
sions for the power function of the test are given, and the 
table of 5% values of the quotient is given for sample sizes 
up to ten. A numerical example is given. 


I, INTRODUCTION 


IDER [1] gave the distribution of the ratio of the ranges of two 
independent random samples from a continuous rectangular 
population 


1 
— for 0S28 % 
Bo (1) 


0 elsewhere 


f(z) = 


and used it to test the hypothesis that two samples came from the 
same rectangular population. The distribution which Rider derived 
would be used to test the hypothesis that two samples came from 
rectangular populations having the same range, 60, irrespective of their 
starting points, when we have a priori knowledge that the starting 
point of the populations is the same, the test proposed in this paper 
appears to be more powerful than the one proposed by Rider. 


II. DISTRIBUTION OF THE QUOTIENT OF MAXIMUM VALUES 


If L denotes the maximum observation in a random sample of size n 
from population (1) then the distribution of L is 


nOo-"L"—"dL. (2) 


To derive the distribution of the quotient of the maximum values of 
two independent random samples from population (1), we let L; be the 
maximum value of a sample of size m, and L; the maximum value of 
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a sample of size n. The joint distribution of L; and L- is then 
mndo-™™ Ly" Lo" 1d LdLe. 
From this we wish to determine the distribution of 
u = Ly/Le. 


It is convenient to regard Lz as the abscissa, and L, as the ordinate in 
rectangular co-ordinates and to transform to polar co-ordinates by 


setting 
IT, = r sin 6; Le = r cos 6. 
Then (3) becomes a function of r and 6, which can readily be reduced to 
a function of r and u, where 
u = tan @. 
The variable r can be integrated out in the usual manner, giving the 
distribution of u, as follows: 
ee ula (4) 
m+n 
udu «LSU So. (5) 
m+n 
It will be noted that the distribution is entirely independent of 4. 


III. DESCRIPTION OF THE DISTRIBUTION CURVE 


The distribution curve obviously has the positive u-axis as an 
asymptote. If m=2 the left-hand part of the curve is a straight line 
passing through the origin and has the slope 2n/(n+2). The slope 
has the value 1 when n=2 and approaches 2 as no. If m>2 the 


curve has a zero slope at the origin. 
IV. MOMENTS 
The kth moment about the origin is 


mn 


~ (m + k(n — k) 





be’ 


In particular the mean is 


- mn 
(m+ 1)(n — 1) 


U 


Mi 
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It follows that the mean will be less than, equal to, or greater than 1, 
according as n —m is greater than, equal to, or less than 1. 
If m=n, (6) becomes 


n? 
mn = se ° (8) 


If u; denotes the ith moment about the mean of the distribution, we 
have after some simplification 


n?(2n? + 1) 
(n? — 1)?(n? — 4) 
2n?(15n* + 7n? + 2) 
(n? — 1)%(n? — 4)(n? — 9) 
3n2(8n8 + 212 n§ + 95n! + 33n? + 12) 
(n? — 1)4(n? — 4)(n? — 9)(n? — 16) 

2(15n* + Zn? + 2)(n? — 4)¥? 
~  a(2n? + 1)92(n? — 9) 

3(8n® + 212n$ + 95n‘ + 33n? + 12)(n? — 4) 
ail n*(2n? + 1)*(n? — 9)(n? — 16) 





y= 





a> 





b= 





a3 





where a3 =ps/pe?/? and a4 =p4/p2”. 
It is easy to verify that 


a;—0 and @—6 as n->~, 


V. CUMULATIVE DISTRIBUTION FUNCTION 


If the probability of the inequality u2U is denoted by P{u=U}, we 
have 
m 


P{u 2 U} oe U-* if U (10) 


n 
| ~~ € F : (11) 


m+n 


Formulas (10) and (11) permit quick computation of the probability 
of getting a result as significant as the one observed. Table 1 has been 
computed from these formulas. 
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TABLE 1 
5% POINTS OF u FOR SAMPLE SIZES 310 
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3.651 
2.252 
1.778 
1.548 
1.415 
1.328 
1.268 
1.224 
1.191 


3.945 
2.410 
1.888 
1.635 
1.486 
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1.274 
1.223 1.236 
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* m denotes the size of the sample having the greater maximum value. 


If the critical region of the test is given by 
u = Ue 


where a denotes the level of significance then the above table gives the 
values of uw. for «=0.05. If the critical region of the test is given by 


u2wu, or 
Uu s U2 
then we determine u; and wz from the relation 


P{u =u} + P{u S ue} = Size of the test, 


which can be rewritten as 


1 1 
P{u =u} + Pi = i = Size of the test, 
u4U 


and observing that the distribution 1/u is the same as that of wu with 
m and n interchanged we can obtain the value of 1/wze and hence we 
from the above table. It should be remembered that the size of the 
test with this critical region is 2a and not a. 


VI. NUMERICAL EXAMPLE 


The following two samples have been drawn at random. We wish 
to test the hypothesis, that they come from the same rectangular 
population starting at zero. 


Sample 1. 0.73, 0.92, 0.07, 0.95 
Sample2. 0.92, 0.19, 0.19, 0.29, 0.38, 
0.16, 0.15, 0.84. 
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We have 
L; = 0.95, m=4 
I, = 0.92, n=8 
. u = Li/I, = 1.03 
The 5% value of u from Table 1 with m=4, n=8 is 
Us, = 1.268. 


We would consequently accept the hypothesis that the two samples 
came from the same,rectangular population. 


VII. POWER OF THE TEST 
Let the sample of size m come from the rectangular population 


1 
— f 0s2s koe 
cin. (12) 


0 elsewhere, and k > 1 


and the sample of size n from the population 


1 
— for 0S28 % 


f(x) = 4 % 
0 elsewhere 


Then the distribution of u is easily seen to be 


mn 
kmu™"'du OSusk (14) 
m+n 
mn 
k*u-"—"'du ksSuSo~., (15) 
m+n 


The most powerful test is obtained by using the likelihood ratio princi- 
ple and in this case we have the test as: 


Region of rejection: u2 U,. 
Region of acceptance: u < U, 


where U, is given by 
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Hence the power of the test is 


P{u2U.} = es if k<U. 


U.\" . 
( ) if k>U, 
k 


If o<k<1, the most powerful test is given by 


(17) 


m-a-n 


Region of rejection: us U,. 
Region of acceptance: u > U, 


Hence the power of the test is 


m k\* . 
P{usU,.} =1- ( ) if Ua>k 
m+n\U. 


n U.\" . 

= (=) if Uz<k 
m+n\k 

The test procedure and the corresponding distribution problem remain 

unaltered except for the slight change indicated below even when the 

population is given by 


(18) 


1 
f(z) =— for CS xr SC+% 


90 


=0_ elsewhere 


where C is any arbitrary constant. 
The known lower bound would have to be subtracted from each 
maximum value before the quotient is computed. 
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THE DISTRIBUTION OF THE PRODUCT OF MAXIMUM 
VALUES IN SAMPLES FROM A REC- 
TANGULAR DISTRIBUTION 


Pau R. River 
Wright-Patterson Air Force Base 
I. INTRODUCTION 


T 1s the purpose of this note to supplement the paper by Murty [1] 
by deriving the distribution of the product of maximum values in 
random samples from the rectangular distribution 


f(x) = 1, 0s281. (1) 


In order to keep formulas simpler, a unit range has been chosen. For 
while Murty’s range 4 drops out in the distribution of the quotient of 
maximum values, such is not the case for the distribution of the prod- 
uct. 


II. DISTRIBUTION OF THE PRODUCT OF MAXIMUM VALUES 


If L is the maximum value of a sample of n from the population (1), 
then, as Murty has shown, the distribution of L is 


nL*"dL. (2) 


Let L, be the maximum value of a sample of size m and L2 the maxi- 
mum value of a sample of size n, the samples being random and inde- 
pendent. The joint distribution of L; and Lz is 


mnL,"—'Le"— 1dL,dL. (3) 


From this we wish to determine the distribution of the product 
v= Ly. 

This is simply a matter of replacing, in (3), Le by L:~'v and dL: by 
L,"dv, and then integrating with respect to L, between the limits v 
and 1, since these values are the minimum and the maximum which L, 
can attain. This leads to 


mn(m — n)—v"—""(1 — v™—")dv, om #7; (4) 
ny"! log (1/v)dv, m=n. (5) 


III. PRODUCT OF k MAXIMUM VALUES 


We shall now consider the product of k maximum values, limiting 
the discussion to the case of equal sample sizes. 

To derive the distribution of the product of three maximum values 
in samples from the population (1) we replace the variable v in (5) by y 
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and multiply the result by (2), obtaining 
n®y"— log (1/y)L"—'dydL. (6) 


Since y is the product of the maximum values of two independent 
random samples and L is the maximum value of a random sample which 
is independent of either of the samples involved in the product y, 
(6) gives the joint distribution of y and L. If we set v= Ly, then v is the - 
product of the maximum values of three independent random samples 
from (1). We replace, in (6), L by y~v and dL by y~'dv. The distribution 
of v can now be found by integrating with respect to y from y=u to 
y=l1. 

Repeating this process again and again, we find that the distribution 
of the product v of the maximum values of k independent random 
samples from the rectangular population (1) is given by 


[n*/(k — 1)!]u"-*[log (1/v) ]*'do, (7) 
a result which can be established by mathematical induction. 


IV. MOMENTS 


For the distribution of the product of the maximum values of two 
independent random samples of sizes m and n respectively (m#n), the 
moment of order 7 about the origin is 


uj = mn/(m + j)(n + J). (8) 


For the distribution of the product of the maximum values of k 
independent random samples, each of size n, the moment of order j 
about the origin is 


yj’ = nk/(n + j)*. (9) 


Expressions for moments about the origin are not simple and will 
not be listed. However, it is not difficult to show that as n>, 


a3 = ps3/0*—> — V2, 


ay = py/ot > 6. 
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FIXED, MIXED, AND RANDOM MODELS* 


M. B. Wrixf and O. Kempruornet 
Towa State College 


Some explicit questions are raised regarding the adequacy of 
assumed linear models as a basis for the interpretation of the 
analysis of variance of randomized experiments. A generally 
applicable method for the derivation of a linear statistica| 
model, based on the experimental situation and the design of 
the experiment, is exemplified. The central features of the 
method are the notion of “experimental unit,” the concept 
of “true response,” and the use of randomization in the design, 
A model is derived for the case where two factors having A 
and B levels respectively, are to be examined with respect to 
a population of P experimental units, where selection of levels 
of the factors to be tested, selection of experimental units to 
be used, and the allocation of selected treatment combinations 
to units is at random. First a linear population model is given, 
based on the structure of the experimental situation, whose 
components are (unknown) parameters of the population of 
(conceptual) “true” responses. Then the conditions of the de- 
sign are imposed to obtain a linear statistical model whose 
components involve the parameters of the population model 
and some defined random variables reflecting the experimental 
procedure and design. The derived statistical model is then 
used to obtain expected mean squares in the analyses of 
variance. A second illustration of the general methodology is 
given for a more complex example originated by Vaurio and 
Daniel and discussed statistically by Scheffé. Some differences 
from Scheffé’s results are noted. 


INTRODUCTION 


URING the past ten or so years the analysis of variance of random- 
D ized experiments has been closely identified with linear models. 
In fact, in many recent publications, the technique of the analysis of 
variance is presented in such a way that its entire meaning and validity 
would seem to depend upon the accuracy of certain major assumptions 
involving a linear model all of whose random components are normally 
and independently distributed. 

It is true that the analysis of variance is very elegantly (and sug- 
gestively) justified and interpreted in terms of such assumptions. The 





* Presented at American Statistical Association meetings, Montreal, September 1954. 

The authors are indebted to H. Fairfield Smith for pointing out an error in Table 5 contained in 
an early draft. 

t+ Work sponsored by Wright Air Development Center, Contract Number AF33(616)-2145. 

t Journal paper number J-2622 of the Iowa Agricultural Experiment Station, Ames, Iowa, Project 
890. 
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" emphasis on these ad hoc assumptions, said to underlie the analysis of 
yariance, has tended, however, to obscure the following important 
items: 


(i) The assumed linear model is not a causal or mechanistic or 
functional relationship, in the sense, for example, that the ideal 
gas law, pvy=RT’, is mechanistic and functional. 

(ii) The normality assumptions are always false in an actual 
situation. 

(iii) The independence assumptions often bear no relationship to the 
physical situation. 

(iv) Many writers lean on randomization as a justification for the 
assumptions, but very few have examined in detail just what 
randomization accomplishes and how it does so. 

(v) Because the models employed are often not explicitly related to 
the experimental situation, there has been some difficulty in 
deciding just what the analysis of variance measures. 

(vi) Because objective methods for deriving models and the proper- 
ties of their components are, in general, not used, there has 
been considerable controversy, even for simple situations, re- 
garding the expectations of mean squares and the selection of 
appropriate error terms. 


We feel that any mathematical assumptions employed in the analysis 
of natural phenomena must have an explicit, recognizable, relationship 
to the physical situation. In particular, if the analysis of variance is 
to be generally useful in the interpretation of experimental data it is 
necessary that its meaning and justification should transcend the set 
of arbitrary assumptions which are usually put forth. 

Largely because explicit and objective methods for obtaining the 
appropriate model for a given experiment are not generally used, one 
finds that certain rules and results, concerning expectations' of mean 
squares and the choice of error terms, given by Mood [10; p. 349], 
Hald [6, p. 480], Mentzer [9], and Scheffé [13] are contradicted by 
Kempthorne [7; p. 574], Anderson and Bancroft [1; p. 341], Tukey 
[14], Cornfield [4] and Villars [17]. 

It is the main purpose of the present paper to illustrate a more objec- 
tive procedure for the derivation (definition) of linear models to be 
used as a basis for the analysis of variance. The emphasis is directly on 
the experimental situation and design. The process of randomization, 
implicit or preferably explicit, plays a central role. 





1 Here “expectation” is used in the standard sense of average value over an indefinitely large number 
of repetitions, 
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RELATION TO OTHER WORK 


We shall attempt to outline very briefly the relation of this paper to 
other relevant work. The pattern of the investigations from which the 
present paper stemmed is a natural generalization of the methods of 
“finite model analysis” given by Kempthorne [7]. The central features 
of both the philosophy and detail are the concept of “experimental 
unit” and the use of randomization in experimental designs. The de- 
pendence on the classic and extensive work of Fisher and Yates de- 
scribed in texts on experimental design and on the work of Neyman 
et al. [11] will be evident. 

The results given in Table 2 (which involve a so-called finite popu- 
lation correction factor) valid for a two-factor experiment in a com- 
pletely randomized design under the restrictive conditions that unit- 
treatment interactions are negligible correspond formally to formulas 
given by Tukey [14] in 1949, Cornfield [4] in 1953, and Bennett and 
Franklin [2] in 1954. It should be pointed out, however, that the 
general results, without the restrictive assumption of negligible unit- 
treatment interactions, given in Table 4, do not correspond. This would 
indicate a difference in approach in the present work. In order that con- 
fusion may not arise as to the logical basis of the various formulas 
which look alike the following remarks are presented. 

As we understand it, Tukey’s [14] results are based on an assumed 
linear model. Tukey made explicit the idea of independent sampling of 
rows and columns, recognized that the interaction term is not inde- 
pendent of main effect terms and deduced expected mean squares 
which include results for random, mixed, and fixed factors. Cornfield’s 
[4] results are based on a well-defined sampling approach, and are 
applicable to the situation where a population is divisible into an 
RXC array of primary sampling units with a constant number of 
sampling units within each primary unit. A sample is obtained from 
the intersections of r rows and c columns taken at random with n 
sampling units for each selected primary. Bennett and Franklin’s [2] 
results appear to have a basis similar to Tukey’s. 

In view of the facts that: 


(1) there is no explicit attention in these particular citations to the 
“experimental design,” by which we mean (following Fisher, 
Yates, Cochran, etc.) the permissible randomization patterns 
for the application of treatments to experimental units ;? 





2 The experimental unit (following standard usage) is not restricted to be a physical entity such asa 
plot of land. It may be more abstract such as a unit of time or a set of conditions. Operationally it is 
that entity to which treatments are assigned at random, 
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(2) these authors do not give a linear model whose properties are 
consequences of the choice of experimental material and of 
treatments and of the allocation of treatments to experimental 
material at random; 

(3) the general results exemplified in Table 4 are different; 


it seems clear that the approach given in the present paper is different 
from that of Tukey [14], Cornfield [4], or Bennett and Franklin [2]. 

The work reported in this paper is part of an investigation (mainly 
supported by Wright Air Development Center) into the meaning and 
derivation of linear models and their use in the analysis of randomized 
experiments. Other results have been given in [21]. 


THE DEVELOPMENT OF A LINEAR MODEL 


To illustrate our view on the meaning and derivation of a linear 
statistical model we will consider in this section a simple experimental 
situation and design, stated in general terms, under certain idealized 
conditions. 

Suppose we have two factors, -4 and B having A and B levels re- 
spectively, which we wish to study with respect to a given population 
of P experimental units. Suppose that the experiment consists of 
selecting at random 


a levels of <4, 
b levels of B, 


p = rab experimental units, 


and applying the selected ab treatment combinations at random to the 
selected units so that each selected combination appears on r units. 
This describes a two factor experiment in a completely randomized 
design and should not be confused with a randomized block type de- 
sign. 

The situation as described is rather general in that if A=a, B=b 
then we have a “fixed model” situation; if A>>a, B>>b then we have 
a “random model” situation; if A=a, B>>b then we have a “mixed 
model” situation, etc.* 

Before going on to a derivation of the model for this situation we 
digress to discuss briefly the notion of experimental error. One might 
describe it, roughly, as the failure of repeated observations, made 





3 We use the symbol “>>” to denote “much larger than.” “~ 
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under similar conditions, to be identical. Causes for this might be 
classified as 


(i) unit errors—due to failure of different experimental units to 
yield identically under the same conditions of treatment; 
(ii) treatment errors—due to inability to duplicate a treatment and 
conditions of its application exactly; 
(iii) measurement errors—due to failure of repeated measurements 
on the same physical situation to correspond exactly. 


This categorization is neither rigorous nor exhaustive, but is suggestive 
of a real distinction between types of errors. 

The unit errors must be regarded as fixed quantities associated with 
particular experimental units. On the other hand, the treatment and 
measurement (henceforth called technical) errors may often be repre- 
sented mathematically as random variables. 

To simplify our discussion in this section we will assume that all 
technical errors are negligible. We can then at once conceive of the 
existence of a number which would be the response if a particular 


treatment combination were applied to a specified experimental unit. 
Thus if 


i = 1,2,---, A denotes the level of <4; 
j = 1,2, ---, B denotes the level of B; 
k = 1,2, ---, P denotes the experimental unit; 


then we conceive of a number Y;;, which would be the true response if 
the kth unit were subjected to the treatment combination consisting 
of the ith level of «4 and the jth level of B. We note that the set 
{Ys} represents the conceivable totality of knowledge which might 
be experimentally acquired in this situation. 

The following relationship is an algebraic identity: 


Vine = w + as + d; + (ab)i5 + ee + riz, 


where the quantities on the right have the following definition and 
physical interpretation: (we use the usual dot convention to denote 
means) 

u=Y...is the conceptual over-all mean response from all treatment 
combinations on all experimental units; 

a;=(Y;..—Y...) is the difference between the mean response from 
all units when subjected to the ith level of -4 in combination with each 


level of B, and u. We call a; the main effect (or simply effect) of the ith 
level of -4; 
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b;=(Y.;.—Y...) is, similarly, the effect of the jth level of B; 

(ab) j= (Yu.—Y.;.—Yi..+Y...) is the difference between the effect 
of the ith level of ¢4 at the jth level of B and the effect of the ith level 
of eA averaged over all the levels of B. We call (ab), the interaction 
effect (or simply interaction) of the ith level of -4 and the jth level of B; 

e=(Y..4—Y...) is the difference between the mean response from 
all treatraent combinations on the kth experimental unit, and ». Thus 
e, measures the deviation of the kth unit from the average of all the 
units, with respect to the average of treatment combinations. We call 
e, the additive error of the kth unit; 

nize=(Y sie— Vij. —Y..2+Y...) measures the difference between the 
deviation of the kth experimental unit from the average with respect to 
treatment combination (77), and ex. We call n;j the interactive error 
of the kth unit and treatment (77). 

We refer to the above relationship as the population model. It is 
based on definition and does not imply a mechanistic or causal picture 
—in fact, the various “levels” of the factors are not distinguished 
quantitatively in the statement of the model. We call particular atten- 
tion to the fact that the definitions of the main effects of each factor 
d.pend in general on the levels of the other factor, and on the experi- 
mental units, which are included in the experimental situation and 
design. It is only when our scale of measurement is such that the factors 
and experimental units are mutually additive (i.e., all interactions are 
zero) that the preceding sentence is too restrictive. 

To simplify the initial presentation somewhat we will now make the 
assumption that the n,;, are all zero. (The assumption is not trivial, 
especially since there is no structuring of the experimental units, but 
even if untrue, it will not, in many cases, affect the interpretation of 
the analysis of variance too heavily, as will be evident from the fol- 
lowing section.) Our population model then becomes 


Yin = w+ a; +d; + (ab)s5 + ex. 
We note that, by definition, 


La = Lb = DX (ab)is = LL (adi) = Lies = 0. 


Now, in the actual experiment we observe a randomly selected sub- 
set of the { Yi}. Let the indexes 7*=1, 2, - --a and j*=1,2,---,6 
denote the randomly selected levels of -4 and 8 respectively in order 
of their selection. Thus, for example, 7*=1 corresponds to some value 
of i=1%;. We make the convention, however, that if A=a then 7* and 
i are the same index and similarly for the case B =b. 
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Let x,*;*; denote the observation of the fth replicate to which treat- 
ment combination (7*j*) is applied, where f=1, 2,---, r for each 
value of (i*j*). We note that to each (7*j*f) there corresponds a par- 
ticular experimental unit, i.e., some value of the index k. To write an 
explicit model for the z,*;*;, it is convenient to introduce some addi- 
tional definitions and notation. 

Let 


= if 7* corresponds to 7, 
0 otherwise; 
6; = 1 if j* corresponds to j, 
= 0 otherwise; 
pS = 1 if (7*7*f) corresponds to k, 
= 0 otherwise. 
The a’s and #’s are similar to the dummy variables used by Cornfield 
[3] in sampling theory. The p’s are random variables which specify 
how selected treatment combinations are assigned to experimental 
units. The quantities can be treated as random variables because 
random methods of selection and allocation are employed. From the 


design of the experiment, certain distributional properties of these 
quantities are easily obtained, for example: 


1 
Prob Ff = 1 <->; 
rob (a ) 7 


1 


Prob (a;* = 1, a;*’ = 1) = ——_——_» 
A(A — 1) 


ii, it Fi*; 


the {a;‘*}, {B,*}, { px‘***} are groupwise independent, etc. 


An explicit model for z,;*;, under the simplifying assumption that 
there is unit treatment additivity, i.e., all n:;.=0, is then given by 


aieity = w+ Doaias + 2) BM; + Do ai*B,*(ab)i; + DE peer. 
‘ i ij k 


This relationship we shall refer to as the statistical model. It is of inter- 
est to note that the population model derives from the experimental 
situation, and the statistical model is then obtained by imposing the 
conditions of the design of the experiment. The random variables in 
the statistical model are, of course, the a;*, 6;* and pei iS, which take 
on the values 0 and 1 with known probabilities. All other quantities 
in the model are fixed, unknown, parameters. 
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This development of a model is rather general. For example, if 
A=a, B>>b then we have a “mixed model” situation with factor 4 
fxed and factor B random. Then we take 7 and 7* to be identical 
indexes. Thus 


> a;"a; = as, and ps a; (ab) i; = (ab) #;. 


Hence, the mixed model becomes 


rity = Viste = wt a; + DBD; + DBM (ab)ig + DE pnien. 
i j i 


THE ANALYSIS OF VARIANCE 
The algebraic structure of the analysis of variance for this design is 
given in Table 1. We use the usual dot convention to denote means. 


TABLE 1 
ANALYSIS OF VARIANCE 








A (a—1) A’ =br p (x*..—z...)%  A*=A’/(a—1) 
B (b—1) B’ =ar ml (2;*.—z...)2 B*=B’/(b—1) 
AxB (a—1)(b-1) T'=r y (xisj*. — ai". 1* =I'/(a—1)(b-1) 


—Z.;*. +2.. .)? 
Residual ab(r —1) R’ = D> (axis —259;*.)? R* =R'/ab(r —1) 
i*;*7 


Total abr —1 G’= > (x*;*;—z...)? 
i*j*s 





Using the derived statistical model and some distributional proper- 
ties of the {ai}, {6,7} and {p:*#/}, the expectations of the mean 
squares can be readily obtained. The algebra is exemplified in the 
Appendix. The results are given in Table 2. The following notational 
definitions are used: 


o. = —— ) a; 


(A ai i 
oo, = ——_ > b; 


aa j 


1 
> (ab);;? 


ae —1)(B-1) ij 


deer? 


“es k 
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TABLE 2 


EXPECTATIONS OF MEAN SQUARES UNDER ASSUMPTION 
OF UNIT-TREATMENT ADDITIVITY 





ee 





Mean Square Expectation of Mean Square 





—_——_. 


(B—b) 


A® oe+ roa? +rbo,? 


B* oe oo roast +rac? 
o2+1oa* 
R* oe 





We see from this analysis that randomization “controls” the unit 
errors, in a statistical sense, and enables the estimation of certain de- 
fined “components of variance,” as well as the error of estimates of 
contrasts among treatments actually employed. Furthermore, tests of 
significance are made possible by use of randomization tests. 

We employed the assumption of additivity of treatments and experi- 
mental units. If the unit-treatment interactions are not negligible then 
as will be seen in the following section the general effect will be to 
underestimate significance and overestimate errors. 

We now turn our attention to the specification of “proper error 
terms.” The present use of the phrase is exemplified by the following: 


E,* is said to be the proper error term for <4 main effects if E,* is 
a linear combination of the analysis of variance mean squares such 
that E(E,*) = E(A*)—rbe,?, where by E(E,*) we mean the expecta- 
tion of £,*. 


The random variable E,* will then have the properties that 


(i) an unbiased estimate of ¢,? is given by a multiple of (A*—£,*); 

(ii) a meaningful “F-type” statistic, for the test of significance of 
cA main effects is given by A*/E,*; 

(iii) a multiple of Z,* is an unbiased estimate of the variance of the 


estimate of any linear contrast of the effects of levels of e4 actu- 
ally used. 


The appropriate multiplier in (i) and (iii) will depend on A, B, a, b, 
and the particular contrast. 

The proper error terms, with unit-treatment additivity, for the gen- 
eral situation under consideration are given in Table 3. 
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TABLE 3 
PROPER ERROR TERMS WITH UNIT-TREATMENT ADDITIVITY 








——_— 


Category Proper Error Term 


b (B—b) b 
_ *_ p*)=— #1... ps 
cA effects I ; (I* —R*) 3 1 +5R 





e_% (ye +) _ (4-9) 04” pe 
B effects I z R*) rr I*+7 Rk 


Ax B interactions R* 





It is easy now to examine some common, important, special cases. 

(a) Fixed Model: A=a, B=b. 

The proper error term for each of -4, B and <4 XB is R*. 

(b) Random Model: A>>a, B>b. 

Under these conditions a/A and b/B approach zero and the proper 
error term for <4 and also for B is I*. 

(c) Mixed Model: A>>a, B=b. 

Here <4 is random and 8 is fixed. The proper error term for the 
random factor <4 is R*. The proper error term for the fixed factor B 
is J*. 

Cases (a) and (b) correspond in a certain sense to Models I and II, 
respectively, of Eisenhart [5]. 


RESULTS UNDER GENERAL CONDITIONS 


The analysis of the two-factor completely randomized experiment 
given in the preceding sections was based on the simplifying assumption 
that the interactions of treatments with experimental units were 
negligible, i.e., that nij,=0 for all ¢, j, k. This assumption was em- 
ployed to simplify the exposition of the general method, which is, of 
course, independent of the assumption above. In this section we indi- 
cate the more general results. 

We note that Dou Nik= > Nie =O. 

The general statistical model is 


ritity = a + Do asa: + 2) BM; + Di a8 (ab) a5 + Le ower 
i j ij & 
+ Do ai" BS pF ni. 
ijk 


The analysis of variance is as given in Table 1. The expectations of 
mean squares under the more general conditions are given in Table 4. 
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In addition to the notational definitions given previously we use 


1 
2 3 = isk 
JP ~ Fone sa @-neay > 
_ 1 
S a ik?s Qatn? = 
Qn 20-0 Qa! = CB -DP oT 


(B — 1)(P — 1) 
* Zz (nijx - 4 = 1. ix)*, 


ijk 











TABLE 4 


EXPECTATIONS OF MEAN SQUARES 
(MORE GENERAL CONDITIONS) 








Mean Square Expectation of Mean Square 


A * o,2 +o,? -—— r Cab? -> Qent | ao rb [ « -5; Qu? 





(A —a) 


Bt ont toe + zp’ [out Qn | +ra [ ot >} Qs. 


1 
I* o,+o2+r [ oa ~ Pp Qabn? | 
R* o? +o,? 





It will be seen from Table 4 that, in general, unbiased estimates of 
4”, oy, and oq” cannot be obtained from the analysis of variance mean 
squares, and that proper error terms do not exist. However, the “bias” 
is of order 1/P, where P is the number of experimental units in the 
population. 

Thus, for example, if E,* is the error term for c4 effects given in 
Table 3, then the bias in using 


(A* — E,*) 
rb 
as an estimate of ¢,? will be [(—(1/P)Q.n?]. In many cases this bias will 
be negligible. 
A MORE COMPLEX EXAMPLE 


To apply these notions in a more complex situation and to contrast 
the procedures with those given by other writers, we will consider an 
example given by Vaurio and Daniel [16], ard discussed in detail by 
Scheffé [13]. 


The example concerns the investigation of several sources of vari- 
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ation in the pack life of cans in a prune pack test. The fundamental 
units as manufactured and treated are long coils of thin steel. Impor- 
tant variation is expected from coil to coil and also from end to end 
within each coil, the latter variation being due, apparently, to system- 
atic variation in the preparation of each coil. The experiment is 
intended to investigate these sources of variation as well as that due 
to several different levels of annealing (a treatment applied to an 
entire coil) on the pack life of cans, under certain standard conditions. 

Suppose then that there are A levels of anneal to be investigated, 
and that L definite locations (say head, middle, tail, etc.) are distin- 
guished on all coils. 

The experiment involves the (random) selection of Ac coils from 
some large (perhaps conceptual) population of C such coils. We then 
apply each of the A anneal levels to c of the selected coils, using random 
allocation. 

From each location on every treated coil are selected at random s 
specimens of metal. These are made up into cans, filled, and time to 
failure observed. 

We will suppose for simplicity that all locations on each coil are of 
equal size, say each containing S possible specimens. 


- + +, A denote the anneal level; 
- ++, C denote the coil in the population of coils; 
- ++, L denote location; 
, 2,: +--+, S denote the specimen for a given location on a 
given coil; 

,* ++, € denote the coils as randomly allocated to each 
anneal level (i.e., for each value of 7, j* takes on values from 
1 toc). 

m*=1, 2, - +--+, s denote the randomly selected specimens for each 
location on each selected coil, i.e., for each value of (1j*k), m* 
takes on values from 1 to s. 


Now let us imagine that the conditions of test, of measurement, and 
of application of treatment are precisely standardized. Then we can 
conceive of a number Yijxm which would be the pack life of the mth 
specimen at the kth location on the jth coil when the coil has been 
subjected to the ith anneal level. Then we can write the identity 


Y ijem = w+ at + aj + aijAe + on! + ant” + aj” 


+ a jx4O” + jem + aij’, 
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when 


nw = Y.... is the “true” over-all average pack life; 
aA=(Y;...—Y....) is the effect of the ith anneal level; 
aj=(Y.;..—Y....) is the effect of the jth coil; 
a;AC=(Yi;..-—Y;...—Y.;..+Y....) is the interaction of the ith 
anneal and jth coil; 
a, =(Y..4..—Y....) is the effect of the kth location; 

antl =(V¥i4.—Y;...-—Y.%.+Y....) is the interaction of the ith 

anneal and the kth location; 
anAL=(Y.,.-Y.;..-—Y..4%.+Y....) is the interaction of the jth 
coil and the kth location; 

ijn ACh = (Vin. — Yiz..— Yer. + Yi... —-YV gp tY¥.j.. 4+ Y.4.—-Y....) 
is the interaction of the 7th anneal, jth coil and kth location; 

Ojkm® = (Y .kjm— Y .j.) is the within coil and location “error” asso- 
ciated with mth specimen; 

Cts jem4S = (Vijem— Yijn.— Y -jem+Y .je.) is the interactive “error” of 
the ith anneal with the mth specimen within the kth loca- 
tion on the jth coil. 

To simplify the algebraic treatment somewhat we will assume that 

the interaction terms {ajjx4°4} and {asjim45} are negligible. Scheffé 
assumes these interactions as well as the {a;4° } are negligible. We 


shall however carry the anneal-coil interactions to observe their effect 
on the analysis. Thus our population model becomes 


Vijem = wet aed + aj Hb arg A? + arnt” + aan,O” + axjem’. 


Of course we know that the conditions of test for cans are not en- 
tirely standard and hence, if we tested the can from the mth specimen 
in the kth location on the jth coil which was treated with the ith anneal 
level we would not observe Yijim. We will assume here that we may 
treat the conceptual observable, y;jrm, 28 a random variable such that 


Yijem = Yijem + €ijem; 


where the ¢;j:m are mutually uncorrelated random variables, with 
mean zero and constant variance o”. 
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Let 2;j*sm* represent the observation obtained from the m*th speci- 
men at the kth location from the j*th coil which was treated with the 
ith anneal level. Define the random variables 


p;** = 1 if (7j*) corresponds to 7 
0 otherwise 
Esj*tem't*™” = 1 if (ij*km*) corresponds to (1j*km) 
0 otherwise. 


Then 
1 


Prob (p;** = )1 = rk 


Prob (p;i#* - 1, pj" i" _ 1) -_ (4j* 4 +j*") 


1 . 
c(c —1)’ 


1 
Prob (£éj%emi**™" = 1) = rh 
the {p;*} and { £:;*im‘**m*} are independent, ete. 


We can then write the following model for the experimental ob- 
servable: 


Tijtem? = wt aA + DY) Bia? + DS pj? + an” + ond” 
j j 
AD pit erjl® + De) p55" Eijtemi?*™* (xin + eijnm)- 
j jm 


If we put 
die. 
aie? = Dy pji*a;°; 
j 
57? 
a;;4¢ = ) ™ pai Ae; 
j 
Le : 
Qi jf = ps p;* aejne®; 
j 
:j* ij*km® ™ 
Asem = Do 15 Essent ™ ams; 
jm 
:j* :j*km* . 
€* stem? = Dy i? Eiitem es iam} 


jm 


then, 


Lijtem? = p+ at + aie? + aise? + an? + arth + 055% 0% 


Fijtem® + €¥ij*em*. 
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This last relationship is notationally similar to the linear mode] 
assumed by Scheffé [13], but some important differences in properties 
exist, for example Scheffé assumes that the random variables a;;*,°" are 
mutually independent, while from our derivation of the model it follows 
that 


De ain? = z( )y Piatt) =) nir( oe ant) = 0. 
k k j j k 


We give in Table 5 the expectations of the analysis of variance mean 
squares obtainable from the statistical model we have derived. The 
following notational definitions are used: 


1 rm 
) » (a;4)?; oo? = Fn Xu (a;°)?; 











ci: =) 4 
ox? = i Ta; ead = CET =p E owt’ 
at = WT =z E eo 
ale x a aay & 0s 
os? - ae GS Dy LD eand*s 9? = Bleinm 


For each of the above definitions, with the exception of oc’, the 
divisor is selected as follows: if the component is based on the sum of 
squares of N quantities which are subject to n linearly independent 
linear relations, then the divisor is (V—n). According to this criterion 
we could define 


1 


ut aie ’ 





for the anneals by coils interaction component; we have defined oac’ 
with a divisor of A(C—1) because this fits a pattern which simplifies 
the statement of more general results. If the reader prefers, he may re- 
place gc” in Table 5 by 


(A — 1) 
A 


, 
CAC’ 7 











de] 
les 
are 
Ws 


un 
ne 


a 
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TABLE 5 
EXPECTATIONS OF MEAN SQUARES 





























DvE TO d.f. M.S. Exprectep MEAN SqQuaRE 
S- A 
etd Abu AF be . ad ( in ae) Leeac? 
+Lsoc?+cLso4? 
en, 
Coils (?) Ate-1) ce 4 ; D tena +bend 
Locations (L ) (L—1) L* o? + os ost+ od = soci? +Acsoz? 
Axl (A—1)(L—1) TIax* eae ) os?*+secr?+cso4L? 
Cx A(e—1)(L—1) Tex* roe os*+socx* 


Specimens (5) AcL(s—1) S* o*+o5* 





The coefficient of o,4c*’ in the expected mean square for anneals is 


1 c 
eae 
A ¢ 
and for coils is ((A —1)/A)Ls. 
In this example S>>s, and we treat the coils actually used as a smali 


random selection from a large population so that C>>Ac. Hence, 


(S — s) Ac C — Ac 
Ce ea es oe 
S (A — 1)C C 


may be taken as equal to 1. Thus we see that independent of any 
assumptions regarding the anneal-coil interactions, we have that 
E(C*) = E(A*) —cLso4?; and so C* is an appropriate error term for the 
evaluation of the effects of the anneal levels in this experiment inde- 
pendent of any assumptions about the {a;;4’}. 

From our analysis we would thus list the following “proper error 
terms” : 


Component Proper Error Term 
ca’ c* 
ge? +oac" S* 
a1 Icx* 
car? Ie1* 


ocr? S* 
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For the same example, Scheffé uses‘ the assumed linear model 
Lijtem = w+ ad + aigee + an” + arnt” + aeigee? + Gij*emeS, 


where the a’s are taken to be fixed constants, and the a’s are assumed 
to be normally and independently distributed random variables, with 
means zero and constant variances within groups. We reproduce in 
Table 6 Scheffé’s expectations of mean squares. 


TABLE 6 
EXPECTATIONS OF MEAN SQUARES, FROM SCHEFFE [13] 











Mean Square Expectation 
A* os?+8secr?+- Lsoc?+cLsc4? 
c* os'+8ec1? + Lsoc* 
L* og'+80c1?+Acsoz? 
Tat* os*+8oc1?+cscar* 
Icx* oc? +8ec1? 
S* os? 





In view of the differences between Tables 5 and 6 the reader may ask 
“who is right?” There can be no simple answer to this question. All 
one can do is to use methodology with the most acceptable assumptions 
and the choice is between the logic of procedures rather than simply 
between results. Our general views on the basis for choice of a statistical 
technique are given above and below; our aim in this section has simply 
been to present the details of our approach for an example to which 
another point of view had been applied. 


CONCLUDING REMARKS 


The subjects of this concluding section fall into three categories: 
first, advantages as we see them of the procedure which has been illus- 
trated heretofore in this paper; second, some views on problems of dis- 
tribution; third, some brief remarks on where and why the various 
types of models are appropriate. 

The following advantages accrue to the procedure suggested in this 
paper: 

(i) An explicit connection is given between the physical situation 
and the components of the model. Thus there is no doubt that 
the statistical model is definitional rather than mechanistic (or 
in Tukey’s [15] language, is tangential rather than functional). 





* We have adjusted Scheffé’s notation somewhat to allow easy comparison with the present results. 








2. 


h 
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(ii) It is made clear that the meaning of the main effect of a certain 


(iii) 


(v) 


(vi) 


level of a factor depends not only on the other levels of that 
factor but also, in general, on the levels of all other factors 
which are involved. 

An explicit definition is given of interaction, and hence some 
insight is provided into the fact that the much used assumptions 
of additivity® of factors and of additivity of treatments with 
experimental material depend not only on the mechanics of the 
experiment but also on the scale of measurement. 

This approach forces on the experimenter and the statistician 
the fact that it is possible to draw inferences statistically only 
about populations from which samples are drawn according to 
probability considerations—i.e., it is the design and procedure 
of the experiment that determines the scope of the statistical 
inference, and extensions of such inferences to wider circum- 
stances cannot be assessed probability-wise. 

The procedure we have used is hinged to the expr-rimental situa- 
tion and design rather than around a set of arbitrary assump- 
tions necessary to justify some esoteric distribution theory. 
Thus attention is focussed on the physical interpretations and 
distributional assumptions are relegated to their proper role of 
serving as a guide in obtaining mathematical approximations. 
Finally, general formulas may be obtained by this procedure 
which are valid for all possible situations of which “all fixed” 
and “all random” models are extreme special cases. On the other 
hand, the usual approach treats fixed and random models quite 
differently and gives no clue on how to write a model for cases 
where the factors are neither fixed nor random. 


There are many distributional problems associated with analysis of 


variance tests of significance which are not hinged on a priori normality 
assumptions. (There are also problems of distribution even when the 
whole battery of normal theory assumptions is made.) Two viewpoints 
suggest themselves. One can use the approach illustrated in this paper 
to derive the appropriate model and specify certain inherent properties 
of the components of the model, and then “superimpose” a sufficient 
number of assumptions in order to obtain so-called “exact” tests. 
Alternatively, one may rely on the fundamental notion of randomiza- 
tion tests, and use as approximations to randomization distributions 
certain procedures suggested by normal theory. (See Welch [18], 
Pitman [12], Kempthorne [7, 8], Wilk [19, 20].) 








5 We use the word “additivity” in the sense of “negligible interaction effects.” 
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We prefer the latter viewpoint of mathematical approximation to 
the former of statistical assumptions because (i) it is less likely to ob- 
scure the tenuous relation between the physical situation and mathe- 
matical or statistical abstractions; and (ii) it faces honestly the fact 
that more than desire for mathematical elegance is required to validate 
statistical assumptions. (These remarks are not intended to depreciate 
normal theory procedures, which we believe serve very importantly 
as general suggestive guides in statistical techniques.) 

The final paragraphs concern the appropriateness of various types 
of models. It is clear that the model must reflect the experiment as 
actually carried out, and not any desire to make a broad scientific 
generalization. This, however just pushes the question back to the 
experimental design. The following experimental situations may be 
distinguished. 


(i) Even though a factor has a well-defined (say quantitatively) 
array (or continuum) of possible levels we choose to examine a 
“representative” set, e.g., in studying the behavior of a process 
in a temperature range say from 100-125°C we choose to study 
it at 100°C, 105°C, - - -, 125°C. This factor, temperature, is then 
to be regarded as fixed. We may of course have prior information 
which enables us to interpolate between the temperatures actu- 
ally used. The analysis of variance may then be used to assess 
the adequacy or reliability of the experiment and perhaps also 
in connection with functional analyses by attempting to find 
scales on which interactions become negligible. 

(ii) A factor may have a definite number of qualitatively distin- 
guishable categories and we have the interest and resources to 
study all of these. For example, we may test a definite number 
of types of vacuum tubes for a given application. Again, this 
factor is regarded as fixed. In this situation, functional analysis 
may be more difficult or of little interest. 

(iii) Resources may not permit us to examine all possible “levels” of 
a given factor, and we examine a randomly selected subset to 
get an estimate of the variability of these levels. For example, 
we may wish to test the lethality of an antibiotic with respect 
to a bacterium which has 50 genetic strains. Our experiment 
involves the random selection of 10 of these strains. This is a 
case where the general development we have given would be 
applied. One might refer to this factor as “semi-random” in 
order to preserve the usual terminology. 
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(iv) We may have a large population of operationally distinct 
entities (such as pilots, carloads of oil, blocks of land, etc.) 
about which we wish to make some inference, and cur experi- 
ment involves the random selection of a comparatviely small 
sample of the large number of “levels” of this “factor.” Such a 
factor is then regarded as random. 

(v) There are some circumstances under which it may be useful to 
consider the levels of a random factor actually used as though 
they were the levels of a fixed factor (with a corresponding 
redefinition of main effects and interactions), but there appears 
to be no objective basis for the converse case. 


APPENDIX 


The object of this appendix is to exemplify the procedure by which the ex- 
pectations of mean squares are obtained from the statistical model. We shall 
outline this derivation for the mean square and the residual mean square of 
Table 1, under the simplifying assumption of unit-treatment additivity. 

Consider the residual sum of squares 


R! = Do (xirity — 2i0j*)*. 
i*y*s 


Substituting the statistical model we find that 


-*. 1 ac yy 2 
R’'= > ( De een — — ter) 
i*j*s k rT fk 
2 1 seis 2 
= 2 [ p ( L per) ——_ per) | . 
i*;* ¥ k r fk 


Now (ox‘i"fps**'7) =0 with probability 1, for ff’. and (px‘i%p, "i =0 with 
probability 1, for kk’, and any “p” variable is equal to zero or unity. Hence, 


*% 1 + % 1 +4 -* o%, 
ti = } > pk’ P te,2 _ 2. pr P Se, _ pM pk‘ ? f py i reser | . 
sof. sh Tr fk r ff’ 
y ; kk! 
Taking expectations, 
1 1 1 1 1 
E(R’) = —e? — — ome Gal <n am Le 
(R’) | ¥ > pm >* Pr Lea | 
kek! 


Tt ‘I Tr fk 





_ (r — 1) (r — 1) 
= ab > ex? ——_— ™ cur | 


P ~ P(P — 1) ow 
But >-.e.=0. Hence 


,¥ exer = — >, ex’. 


kk’ 7 
So, 
se ab(r — 1) / 
E(R’) - bP LD (P 1+ 1) 2 et. 


= ab(r — 1)e.? 
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Dividing by the degrees of freedom, we obtain E(R*) =, as given in Table 2. 
Turning now to thec/4 sum of squares, in Table 1, 


A'=br  » (aie... — 2...)? 
= br » [( »» aa; — ~ 2 aia) + - 2 ax” B* (ab); 


1 
én i*9 3° wh) +(— i*j* 
&, ay B; (ab).;) ( br p> pre? 7ey 


1 2 
on agen i*i*7 
abr a,” a)| 
= br dX ¢ (Au* + Aaxi* + Axi*)*, say. 
All products such as A,*Ax* have expected value 0. For example, 


Casi - 
E(Ay*Ag*) = E( E aia: - — F a a) 


1 1 
it ams Pia ant: meen >» i*j*7 ) 
Pk ek Pk ek 
(; i*fk abr s*j*sfk ’ 


and 


1 
E( aa; ete) = — aye 
es a é Pk hk x, AP k 
= 0, since a;** and p,**i¥ are independent, 


and > .e.= > :a;=0. Similarily, the other cross products in Ax*As* have ex- 
pectation 0. 
Thus, 


B(A’) = brE[ 3 (Aust + Ane? + Ane?) | 


Now, 


B( & Aw!) = #[ F( Eau) -2( Ewa) '] 


1 1 , 
= £ » a;* a3? = © » a az— — 7 aay” ayay?, 
a a i*mi* 
ips’ 


by the same reasoning as in dealing with R’, 


(a — 1) a(a — 1) 
-—j +*- Gann & 





= (a — 1)e,?, since  ¥ a; = 0. 


Fix 


Sim 








FIXED, MIXED, AND RANDOM MODELS 1165 
Similarly 


#(& Ax?) - zr} = (+ 5 BF (ab)is — = &, ai“Bi"(ab)a] | 
- z[ Z(FE aaa 


1/1 i 2 
7 a ( b &, - B, (ab)ss) ] 
We have 


[ 2 ai*B,/"(ab)si | = » [ u ai*Byi*(ab)ai | 
+ ELE atarcadds |[ Zao" Cady | 


i. 


=) X a6," (ab);;2 + de Dd ai**B,i*a** 8,5” (ab) sj(ab) 7. 
i i* i" ‘ 
ivi? 


The expectation of this is 


17 oo ~— b (B-») u 
a (abut AB(B — 5 E (abut = AB (B—1) y 2 (ab)si* 


Also 
[% 2X ai"B, (ab) | = ZL ae ai *B,*(ab):? | 
+ DX asi “ea | Eat arab] 


one 
> [ 2X a;*p,i “(ays |[ L ait "Bi" (ab)as | 
&. 


[Ewe ys |[ X eB" (ab), | 


isnt 
i*i*’ 


The expectation of this is 
a(a — 1) b(6 — 1) a(a — 1)b(b — 1) 
[ 2, (abs ] [a - A(A— 1) AB(B—1) A(A—-1)BB-1 
ab (A —a) (B — b) 
“ 2X (0b) (A-D) Bo) 








Combining terms we have 
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In the same way we find that 


E(Ay*) = ea 1) oe". 


Hence 





B(A’) = br[ (a — est + 2 Oe ab e_” «| 


b B 
— b) 


B 
= (a _ [ot + Eo” 


Dividing by the degrees of freedom, (a—1), we find 
(B — b) 


E(A*) = o,? + —" row? + bro’, 


Toa? + ast | . 


as given in Table 2. 
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COMPONENTS OF A DIFFERENCE BETWEEN TWO RATEsS* 


Evetyn M. Kitagawa 
University of Chicago and Scripps Foundation 


HEN comparing the incidence of some phencmenon in two or more 

groups, social researchers place much emphasis on the need for 
holding constant those related factors that would tend to distort the 
comparison. For example, before comparing the death rates for the 
residents of two areas, demographers frequently control the factors of 
differences between the areas in age, sex and race composition. A tech- 
nique commonly used to accomplish this is “standardization” of the 
rates for the two areas by relating them both to a standard population 
with specified age-sex-race composition. By applying the schedule of 
age-sex-race specific death rates for each of the groups to the age-sex- 
race composition of the standard population, then noting the total 
death rate that results, it is possible to compare the death rates for the 
areas with reasonable confidence that differences in age, sex and race 
composition do not explain the differences between the rates for the 
areas that still remain after they have been standardized. Controlling 
the effect of related factors by this method is termed direct standard- 
ization.} 

It is often noted that such standardized rates are “artificial.” For 
example, an age-sex-race-standardized death rate indicates what the 
total (or crude)? death rate of a population would be 7f it had the age- 
sex-race composition of the standard_ population while retaining its 
own age-sex-race-specific death rates. While such a measure may not 
be very useful for descriptive purposes, it is an important analytical 
device. 

Since the crude rate of any population is its “real” or “observed” rate 
in the (descriptive) sense that it is the total rate which results from 
the particular composition and specific rates which prevail in that 
population, a systematic statement of relationships between crude and 
standardized rates for two or more groups may help to bridge the gap 





* Expanded version of a paper read at the annual meeting of the American Statistical Association 
held in Chicago, December 27-30, 1952. The preparation of this manuscript was sponsored jointly by 
the Population Research and Training Center, University of Chicago, and Scripps Foundation, Miami 
University—the latter through funds provided by the Rockefeller Foundation for the study of popula- 
tion distribution. The author is indebted to Philip M. Hauser, Donald J. Bogue, O. Dudley Duncan, 
Beverly Duncan and J. J. Feldman for a careful reading of the paper aud many suggestive comments 
and criticisms. 

1 For a description of the standardization procedure, the assumptions involved, and some of the 
limitations see A. J. Jaffe, Handbook of Statistical Methods for Demographers (Washington: U. S. Govt. 
Printing Office, 1951), Chapter III. For handling cases where a complete schedule of specific rates is 
not available for the particular areas being compared, but a cross-tabulation of the population by the 
variables is available, an alternative procedure termed “indirect standardization” has been developed. 
2 The terms “crude,” “total,” and “unstandardized” are used interchangeably in this paper. 
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between the “observed” crude (or total) rates and the “artificial” 
standardized rates. As yet, very little attention has been directed to 
the problem of formalizing the analysis of standardized rates, and of 
systematically explaining which factors account for the differences be- 
tween standardized rates in comparison with corresponding differences 
between their unstandardized rates. If standardization alters a differ- 
ence between two total rates, it should be possible to measure the 
amount of change, and to break it up into components attributable to 
the various factors for which the data were standardized. Formalizing 
the process of making inferences from standardized data, and establish- 
ing a technique whereby the change accomplished by standardization 
can be interpreted in terms of the factors involved, are the objectives 
of this paper. 

The technique presented here is called “components of a difference 
between two rates.” It is a revision and refinement of a mode of analy- 
sis utilized at the University of Chicago since 1948.* The purpose of the 
technique is to explain the difference between the total rates of two 
groups in terms of differences in their specific rates and differences in 
their composition. Thus, the components framework is broader in 
scope than that of standardized rates, since the framework of the latter 
is designed to summarize and compare differences in two (or more) sets 
of specific rates. 

The basic concept of separation into components has been used in 
research for some time. Implicitly, it has been used whenever the size 
and direction of the difference between standardized rates for two 
populations was compared with the size and direction of the difference 
between their crude rates, and the inference made that the “difference 
between these two differences” is the result of the different composi- 
tions of the two populations. Explicitly, it has been used in research 
under various headings. For example, in the 1920’s Ogburn determined 
what part of the difference between the per cent of the U. S. population 
married in 1890 and the per cent married in 1920 was due to changes in 
the age composition of the population between these two dates.‘ The 
chapter on standardization in Jaffe’s Handbook of Statistical Methods 
for Demographers includes a section titled “Removing the influence of 
changing occurrence rates” which discusses a similar procedure. In his 





3 Earlier statements were prepared by Ralph H. Turner and the writer, with the counsel of Philip 
M. Hauser: Evelyn Kitagawa, “A Method of Analyzing the Influence of Several Non-Quantitative 
Factors on a Result,” (Dept. of Sociology, University of Chicago, March, 1948, hectographed); Ralph 
Turner, “Whites and Negroes in the Labor Force” (unpublished Ph.D. dissertation, University of Chi- 
cago, September, 1948); Turner, “The Expected Cases Method Applied to the Nonwhite Male Labor 
Force, ” American Journal of Sociology, LV (September, 1949), 145-56. 

«EE. R. Groves and W. F. Ogburn, American Marriage and Family Relationships (New York: 
Henry Holt & Co., 1978), 160-2. 
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example, the difference between the proportion of the total population 
5 to 20 years of age attending school in 1940 and the proportion attend- 
ing school in 1910 is, in effect, separated into two parts, one called the 
“influence of changes in age distribution” and the other the “influence 
of changes in occurrence rates.”® A similar procedure is applied to an 
analysis of the 1890-1930 change in the proportion of persons gainfully 
occupied, in an article by Wolfbein and Jaffe.® 

Edwin C. Goldfield’s “method of multiple standardization with allo- 
cation of interactions” is concerned with the same general problem but 
carries the analysis further to include consideration of several factors 
simultaneously, and to evaluate the net influence of each of the factors 
as well as their interaction. This method is used, and very briefly de- 
scribed for particular examples, in Durand’s study of the labor force.’ 

The revised components framework described in the present article 
was first used in a study of labor mobility. Any comparison of this 
definition of components with previous definitions requires a state- 
ment of the rationale underlying the components analysis, as well as 
some standard terminology and algebraic notation. These will be de- 
veloped in the course of presenting the writer’s revised approach, and 
a comparison of the various definitions will be made later. 

As has already been noted, components are closely related to stand- 
ardized rates. Because the method of standardizing rates is familiar to 
research workers, and because components may be defined by sub- 
tracting a standardized rate difference from a crude rate difference, the 
components analysis will be approached by putting standardized rates 
in a components framework. 

The analysis will be developed first with two factors (I and J) con- 
trolled. Formulas for one factor (I) will be given later, and an ab- 
breviated extension to control three or more factors will also be dis- 
cussed. The factors included in the analysis may be either quantitative 
or non-quantitative. 





5 Jaffe, op. cit., pp. 44-6. While Jaffe’s discussion of the problem emphasizes its interpretation as a 
method of “holding constant changes in the occurrence rates,” in contrast to the conventional method 
of computing standardized rates where composition is held constant, there is a clear separation into the 
two parts (or components) mentioned above, in Table 4, p. 46. 

*S. L. Wolfbein and A. J. Jaffe, “Demographic Factors in Labor Force Growth,” American Socio- 
logical Review, XI (August, 1946), 393-6. 

7 John D. Durand, The Labor Force in the United States, 1890-1960 (New York: Social Science Re- 
search Council, 1948), Appendix B. Goldfield’s method is very similar to the components analysis de- 
scribed in the 1948 manuscripts of Kitagawa and Turner. 

8 Evelyn M. Kitagawa, “The Relative Importance—and Independence—of Selected Factors in Job 
Mobility, Six Cities, 1940-49” (Chicago Community Inventory, University of Chicago, hectographed 
report, 1953). 
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CONVENTIONAL STANDARDIZATION IN A COMPONENTS FRAMEWORK 


Suppose we have observed a difference between the crude rates of 
two groups, p and P.® Also, suppose that data for each group are cross- 
classified by two factors, J and J. 

Let 

n;;=number of persons in both the ith category of I and the jth 

category of J in population p 
N,;=number of persons in both the ith category of I and the jth 
category of J in population P 
t;;=rate for persons in the ith category of I and the jth category of 
J in population p 
T;;=rate for persons in the ith category of J and the jth category 
of J in population P 


Also 

n.. and N..=total number of persons in populations p and P, re- 
spectively. 

t.. and 7'..=rate for total persons in populations p and P, respec- 
tively (i.e., crude rates of p and P). 


The I.J-composition of p and P is designated by n,;/n.. and N,;/N.. 
respectively. That is, JJ-composition is a proportionate distribution in 
which the number of persons in each JJ cell has been divided by the 
total number of persons in the group. In convertional summation no- 


tation 
DEM 
é 8 


n.. 


+ don; and N.. 
i 


7 


= 2d 


If we let p represent the group with the higher crude rate, the differ- 


Nij 


and T.. 





Ni; 
pp oe gr by definition. '° 


oe J 





® The term crude rate is used here to refer to the over-all unstandardized rate of any specified group— 
that is, we may be concerned with an analysis of the components of the difference between crude rates 
of two age groups. Although such rates are usually called age-specific rates, in this context they are total 
or crude rates if the two age groups are the population groups with which we are concerned. 

The concept of a rate in the components analysis includes percentages and means. Medians cannot 
be used because they do not have the algebraic properties of means and percentages which are utilized 
in the components method. 

10 Since the crude rate of any population may be regarded as the result of its own JJ-specific rates 


weighted by its own JJ-composition. 
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ence, ¢..—T'.., will be positive. Expressing this difference in the equa- 
tion 
Ni; 


N 
t.. = Ldtss - 2d Ts 
, ‘4 





makes explicit the fact that the difference between the crude rates of p 
and P is due to differences in both their IJ-specific rates and their 
IJ-composition. 

As has already been mentioned, conventional standardization tech- 
niques can be utilized to compute JJ-standardized rates for p and P, 
which will summarize differences in their I.J-specific rates holding con- 
stant differences in their [.J-composition. In the notation outlined 
above, the difference between J.J-standardized rates for p and P may 
be expressed as follows: 

Nij 


x ue — ~ (bis — T;;) if group pis used as standard" 
X rF oe = (ts — T;;) if group P is used as standard 


X pe a ~ (i — T;;) if a third group, p’ is used as standard 


Thus, the difference between the standardized rates of two groups is a 
weighted average of differences in their IJ-specific rates, with the 
IJ-composition of the standard population used as the weights. 

The objective of the components framework is to allocate the differ- 
ence between two crude rates into components which reflect differences 
in specific rates of the two groups, on the one hand, and differences in 
their composition, on the other hand. The equations above suggest that 
the difference between the [.J-standardized rates of two groups might 
be used as the “component due to differences in specific rates,” since 
this difference may be considered a measure of their differences in spe- 
cific rates. Furthermore, if the difference between two standardized 
rates is subtracted from the corresponding difference between two 
crude rates, it is easily demonstrated that the result is a weighted 
average of differences between the composition of the two groups. For 
example, when the difference between the JJ-standardized rates of 
groups p and P, with group p used as the standard, is subtracted from 





U Since 2,2; tij(nij/n..) =t..=both the crude and JJ-standardized rate for p, and since 
2:2; Tij(nij/n..) =the IJ-standardized rate for P, when p: is used as the standard population. 
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the difference between their crude rates, the result is 


(t.. ~ 2 )-LE Sw Pe) 


-DEw-D Ew - Ere +E 


Nis N ‘ . 
= 2 ut oa - + _ =*) = weighted average of differences in 
, ““-_[J-composition of p and P, with the 
IJ-specific rates of P as the weights. 


In this case, the difference between the crude rates of p and P may be 
expressed as the sum of two major components as follows: 


As fas. Ny 
-EE wm + EE (SE -F) 








where 


Nj 
u X - = by — T;;) = component due to differences in I[J- 


specific rates 
= difference between J.J-standardized rates 
(p as standard) 
Ni; 


~ Er.(= oN 





4 “* 


) = component due to differences in JJ- 
composition 


However, these equations show that different standard populations are 
used as weights in the two components. That is, while p may have been 
purposely selected as the standard population to provide weights to 
measure the difference in I.J-standardized rates, the net result—from 
the components framework—is that the other group, P, is the standard 
population which provides weights for summarizing differences in 
IJ-composition of p and P. 

Similar results are obtained if the second population, P, is selected 
as the standard for computing I.J-standardized rates for p and P. Thus, 
it is also true that 


Niz Ni 


og r+ bbw (-54* 
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where 


Xu rF —— (ti — T,;) = component due to difference in I.J-specific 
rates 
= difference between JJ-standardized rates 
(P as standard) 
Ni; Ni; 








= component due to differences in JJ-com- 


EE4(- 


position. 


In this case, when the J/-composition of P is selected to weight differ- 
ences in IJ-specific rates, the IJ-specific rates of p are implied as 
weights for the component due to differences in JJ-composition. 

When a third population, p’, is used as the standard for computing 
I J-standardized rates for p and P, the following components are the 
result: 


(t..-—T..) = p ie —~ (4; — Tis) 








+ eee Seer -7) 


) 


M 

M 

. 
2 
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= component due to differences in [.J-specific 


rates of p and P 


difference between /.J-standardized rates of 


p and P (p’ as standard) 


and 


PEs(-H+EEn (Ef) 


N.. 





= component due to differences in JJ-composition of p and P. 


The last equation makes explicit the weights implied in the “composi- 
tion component” when a third population, p’, is used as the standard 
for computing standardized rates. Specifically, the difference in IJ- 
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composition between p and P is broken into two parts—the difference 


between p and p’ or 
(= ~) 
a... Ws. 


and the difference between p’ and P or 


oe 

n’.. N..]’ 

and IJ-specific rates of p are implied as weights for the first part, while 
[J-specific rates of P are the weights for the second part. 

Thus, the statement that a population, p’, is used as a standard 
population for computing IJ-standardized rates for p and P means 
specifically that the IJ-composition of p’ is used to weight differences 
in IJ-specific rates of p and P. If, in interpreting the results of stand- 
ardization, the difference between the standardized rate difference and 
the crude rate difference is attributed to the different composition of 
the two groups, it should be recognized that the I.J-specific rates of the 
two groups themselves are the weights, or standard, for summarizing 
differences in their /.J-compositions. 


MAJOR COMPONENTS (TWO FACTORS, J AND J) 


The components analysis starts directly and explicitly from the per- 
spective of allocating a crude rate difference into parts attributable to 
differences in composition and specific rates. Suppose the crude rates 
refer to the two groups, p and P. Also, suppose that their specific rates 
and composition are classified by two factors, J and J. Then, an un- 
ambiguous allocation of the crude rate difference into two major com- 
ponents—-one reflecting differences in JJ-composition only, and the 
other differences in I.J-specific rates only—will be obtained if two sets 
of weights, w,; and w,;’, are selected which satisfy the equation 


wo ONG 
..—-T..= > > wy (~ = )+z pa w'is(tij — Tei). 


n.. ON. 





The first component on the right side of the equation represents a 
weighted sum of differences in JJ-composition, and the second com- 
ponent is a weighted sum of differences in IJ-specific rates. These com- 
ponents will be called “Combined JJ” and “Residual IJ,” respec- 
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tively.!2 A meaningful interpretation of the weights is obtained if the 
w,; are a set of IJ-specific rates, and the w’,; define an IJ-composition, 
In this case, the crude rate difference (which is due to differences in 
both IJ-composition and IJ-specific rates) is expressed as the sum of 
two components: (1) Combined I/, or differences between the I.J-com- 
position of p and P, with IJ-specific rates held constant; and (2) Re- 
sidual IJ, or differences between the I.J-specific rates of p and P, with 
IJ-composition held constant. 

Thus, the chief distinction between the components perspective and 
that of conventional standardized rates is in the specification of a 
standard population. In the components framework, the standard 
population must include a set of IJ-specific rates to be used as weights 
for the composition component, as well as an J.J-composition to be used 
as weights for the “specific rates” component. 

The results of the preceding section indicate that the IJ-specific rates 
and the JJ-composition of the same population will not satisfy the re- 
quirements of a standard population for the components framework if 
the objective is an unambiguous two-component solution.” For exam- 
ple, when the JJ-composition of p was used as the standard for com- 
puting standardized rates, the JJ-specific rates of P were implied as 
the weights for the component due to differences in composition. There- 
fore the three sets of components discussed in the preceding section are 
excluded as possible two-component solutions unless the standard 
population is conceived as a population with the JJ-composition of one 
group and the IJ-specific rates of another group; for example, the 
composition of p and the specific rates of P. Although for particular 
problems a researcher may be willing to define such a standard popula- 
tion, the two-component solution proposed below would seem to be 
more generally useful from the components perspective. 

Since the standard population for a two-component solution inevita- 
bly involves some characteristic of both p and P, the possibility of using 
the average composition and the average specific rates of these two 
groups to define the standard population is suggested. In our notation 





12 The term “Residual JJ” is assigned to the component due to differences in JJ-specific rates, since 
it measures the difference between the total rates of the two groups after IJ-composition is held constant, 
while the crude rate difference measures the difference between total rates of the two groups without 
taking into account differences in JJ-composition. 

13 This statement is, of course, limited to the three populations discussed in the previous section as 
possible standards for computing standardized rates—namely, p, P or a third “observed” population, p’. 
We have not yet discussed the use of certain “hypothetical” populations. Also, we have thus far required 
a set of components to allocate the crude rate difference into two parts only, one due to differences in 
specific rates and the other to differences in composition. 
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# = component due to differences in /.J-specific rates (with 
d average JJ-composition of p and P as weights) 

e = difference between JJ-standardized rates of p and P 
: (with average JJ-composition of p and P as the 
“ standard) 

, Thus, a standard population having the average IJ-cc aposition and 


the average I.J-specific rates of p and P does allocate the difference be- 
P tween their crude rates into two major components, one reflecting dif- 
ferences in their IJ-composition and the other differences in their 
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, I J-specific rates.“ 
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4 It may be noted that the use of a weighted average will not yield the symmetric results of the 
simple average. For example, if the specific rates of p are assigned a weight of 2 and those of P a weight 
of 1 to obtain weighted average specific rates as weights for the “composition component,” then in the 

J weights for the “rates component” the composition of P will have a weight of 2 and the composition of 
| p a weight of 1. That is 
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This solution is proposed here as the most meaningful one for genera] 
purposes when the components framework is used. However, an al- 
ternative three-component solution may be considered, especially for 
certain types of comparisons. For example, if the two crude rates refer 
to the same population at two different dates, the following questions 
might be asked: (1) How much change would there have been in the 
crude rate between the two dates if the JJ-composition of the popula- 
tion changed as it did but the IJ-specific rates had remained constant 
(as of the earlier date)? (2) How much change would there have been 
in the crude rate if the IJ-specific rates changed as they did, but the 
IJ-composition had remained constant (as of the earlier date)? (3) If 
the changes measured in (1) and (2) do not add to the total change in 
the crude rate, by how much do they fail to do so? If we let p represent 
the population at the later date, and P the population at the earlier 
date, the total change in the crude rate between the two dates may be 
expressed as the sum of three components, as follows: 


Neaj Nij Ni; 
t..—T..= rrr (-— ) + } pM ny. ta Tis) 


nN.. 


ss 
+ DD tu - ta) (~ -—). 


n:. WN. 





The first (or composition) cemponent measures changes in JJ-compo- 
sition assuming no change :i: «./-specific rates; the second (or rates) 
component measures changes in I.J-specific rates assuming no change 
in IJ-composition; and the third (or interaction) component involves 
changes in both IJ-composition and I.J-specific rates. 

An exchange of comments with several readers of this paper revealed 
some differences in preference for the two- and three-component solu- 
tions, particularly when the two crude rates refer to the same popula- 
tion at different dates (or to any comparison where one set of events 
may be considered to precede another). In such a comparison, it may 
be argued, a three-component solution, with the initial population pro- 
viding the weights for both the “rates” and “composition” components, 
appears to be a logical approach. The interpretation of components in 
this case is described above. 

On the other hand, it is the writer’s opinion that a good case can be 





4 It may be noted that these three components reduce to a two-component solution in either of two 
ways. If the first and third components are added, the result is a composition component with the 
specific rates of p as weights. If the second and third components are added, the result is a rates com- 
ponent with the composition of p as weights. 
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made for the two-component solution in such situations. A brief dis- 
cussion of the reasons for this preference may clarify the rationale of 
both solutions. First, the selection of a standard population for a com- 
ponents analysis of the change in a crude rate between two dates can 
be made from any of several assumptions. The three-component solu- 
tion derives a rates component assuming no change in composition, a 
composition component assuming no change in specific rates, and an 
interaction component reflecting changes in both rates and composi- 
tion. However, changes in rates and composition are seldom independ- 
ent—rather, a change in one is likely to affect the other. It may be 
argued, therefore, that since buth were changing during the period, a 
logical set of weights for summarizing changes in specific rates, for 
éxample, would be the average composition of the population during 
the period; similarly, the weights for the composition component might 
be the average specific rates experienced by the population throughout 
the period. The two-component solution uses averages for the two 
dates as weights for these components. While such averages are not 
equivalent to averages for regular intervals throughout the period, they 
are often the only averages which can be obtained; also, even when data 
are available, the computation of annual averages, or averages for any 
other regular interval, would be a very laborious task. It may be noted 
in this connection that the sir ple average for the beginning and end of 
the period will equal the annual (or other regular interval) average if it 
is assumed that changes were distributed uniformly throughout the 
period. 

Second, the selection of a standard population for a components 
analysis of crude rates for two dates is in many respects comparable to 
the problem of selecting weights for index numbers. Economists give 
careful consideration to alternative sets of weights when computing 
index numbers for two dates, and an average of appropriate values for 
the two years is frequently used.'* For example, to compute an index of 
the physical volume of manufacturing production for 1947 relative to 
1939, the Bureau of the Census defined change in physical volume as 
the “change in value of net ouput, or value added [by manufacture], at 
constant prices,” and used the average prices for the two years as 
weights to be applied to the quantities of each product included in the 
index.!? An advantage of the use of average prices, as compared with 
prices for the first year, is that it avoids overweighting products whose 





18 The average used in Fisher’s “ideal” index is the geometric mean, while the simple arithmetic 


mean is used in the Marshall-Edgeworth formula. 
17 Bureau of the Census, Census of Manufactures: 1947, Indexes of Production, pp. 2-4. 











{ 


1180 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER p35 


prices have shown a considerable “relative” decrease between the two 
years, and also avoids underweighting products whose prices have 
shown a considerable “relative” increase during the period. 

In the general case—when the crude rates refer to two different 
groups, neither of which may be considered to precede the other—there 
is no “initial state” from which to measure change and, therefore, less 
emphasis on the desirability of using the rates and composition of one 
group as weights for both components. An unambiguous allocation into 
two components, using the average rates and average composition as 
weights, is a possible solution, and appears more logical than the various 
alternatives considered.'* 


SUBCOMPONENTS OF COMBINED IJ 


Combined IJ, the composition component of the two-component 
solution, may be further divided into three subcomponents as follows: 
Combined IJ = Net I;+Net J;+Joint IJ where 
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18 If it is desired, a range of variation for each component, as a result of the different weights which 
might be used in its computation, might be estimated by computing each component twice—once with 
one of the two populations providing the weights, and a second time with the other population pro- 
viding the weights. Thus, the two values for each component defined by the two equations 


ng | ONG Nj 
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might be used to estimate the range of variation for each component. It may be noted that each major 


component in the two-component solution proposed in this paper is an average of the two values ob- 
tained for this component from these equations. 
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In these formulas,?° n;, and N;, represent the total number of persons 
in the ith category of factor J in groups p and P, respectively; and n.; 
and N.; represent the total number of persons in the jth category of 
factor J in p and P, respectively. 

Net I, is a weighted sum of differences in Jy-composition (J-compo- 
sition within the J subgroups)” of p and P and, therefore, may be in- 
terpreted as that part of the difference between their crude rates which 
is attributable to differences in net Jy-composition, or J-composition 
independent of J. The average IJ-specific rates and average gross 
J-composition of p and P are used as weights in this subcomponent.” 
That is, Net J; measures differences in Iy-composition applied to a 
standard population which has the average IJ-specific rates and the 
average gross J-composition of p and P. 

Similarly Net J; is a weighted sum of differences in J;-composition 
(J-composition within the J subgroups) and may be interpreted as that 
part of the crude rate difference which is due to differences in net 
J-composition, or J-composition independent of I. The average IJ- 
specific rates and average gross J-composition of p and P are used as 
weights for this subcomponent. That is, Net J; measures differences in 
J;-composition applied to a standard population which has the average 
IJ-specific rates and the average gross I-composition of p and P. 

Joint IJ is that part of the Combined IJ component which cannot be 
allocated to differences in net J;-composition or to differences in net 
J-composition. It represents the part of the crude rate difference which 
is accounted for by differences in combined IJ-composition but which 
cannot be allocated independently to J or J. Its equation shows it to be 
a weighted sum of differences in IJ-composition which result from 
combining the net composition of one of the two populations with the 
gross composition of the other,” using the average IJ-specific rates of 
the two populations as weights. 





19 Because the composition component of the three-component solution is not a total measure of 
differences in IJ-composition—part of the composition differences are included in the interaction com- 
ponent—there is no discussion of its subcomponents. 

20 Note that nj. = Lj nij, n.7 = Zi nij, ete. 

21 In the equation for this subcomponent, (njj/n.j —Nij/N.j) represents the differences in J-compo- 
sition within the subgroups of factor J. 

22 The weights are enclosed in brackets, [ ], in the equations defining the subcomponents. Gross 
J-composition refers to the per cent distribution of a group when it is classified by factor J only. For 
example, in group p gross J-composition is defined by n.;/n... 

% Specifically, the term 


expresses the difference between (1) the net J7-composition of P combined with the gross I-composition 
of p, and (2) the net J;-composition of p combined with the gross I-composition of P. Similarly, the 
term 
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Thus, a complete components analysis allocates the difference be- 
tween the crude rates of p and P into four parts: 


t.. — T.. = Net J, + Net J; + Joint JJ + Residual JJ 


where Net J, is the part due to differences in J-composition inde- 
pendent of J; Net J; is due to differences in J-composition independent 
of I; Joint IJ is due to differences in Joint IJ composition, or to differ- 
ences in Combined JJ composition which cannot be allocated inde- 
pendently to J or J; and Residual IJ is due to differences in IJ-specific 
rates of p and P. The standard population used for this purpose is one 
having the average I./-composition and the average IJ-specific rates 
of p and P. 


ONE FACTOR COMPONENTS 


If data for the two groups, p and P, are cross-classified by only one 
factor, I, a two-component allocation of the difference between their 
crude rates may be defined as follows: 


t. — T. = Gross J + Residual J 


t+ 7; ni N; 
Gut « To Merk 
wel = Bee) 


where 





n. . 
> a tara (t; — Ti). 

The Gross J component represents that part of the difference be- 
tween the crude rates of p and P which is due to differences in their 
I-composition, and Residual J the part due to differences in their 
I-specific rates. Residual J is also equal to the difference between the 


Residual J 





expresses the difference between (1) the net y-composition of P combined with the gross J-composition 
of p, and (2) the net Iy-composition of p combined with the gross J-composition of P. These two sets of 
differences—one for the combination of net Jy and gross J compositions, the other for the combination 
of net J; and gross I compositions—are obtained because the two factors J and J, are interchangeable 
in this subcomponent. Therefore, corresponding differences (that is, differences for the same IJ cell of 
the cross-classification) in these two sets are averaged, and the resulting set of average differences are 
weighted by average IJ-specific rates of p and P. This interpretation of the Joint JJ subcomponent 
is presented to state explicitly the differences which are measured. It is not necessary to compute these 
differences to obtain this subcomponent since it may be obtained more simply by subtracting the sum of 
subcomponents Net Iy and Net J; from the major component Combined IJ. 
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]-standardized rates of p and P, with the average J-composition of p 
and P as the standard population. 

A three-component allocation similar to that described for two fac- 
tors, is as follows: 


Ny N; 
1-7. > 7(=- 
X n. N. 





Ni 
)+z yw &— ft 





. Ny N t 
+ (t; = T;) (~ - ). 
X n. ON. 
In this case, the J-specific rates and I-composition of group P are used 
as weights for the “composition” and “rates” components, respectively. 
The third component is an “interaction” component which is due to 
differences in both composition and rates. 


INTERPRETATION OF RESULTS 


We shall first consider the two-component solution. The interpreta- 
tion of the two major components and the subcomponents is relatively 
simple when all the components are positive, since the difference in 
crude rates can then be attributed partly to differences in I.J-specific 
rates and partly to differences in I.J-composition, [;-composition, Jr- 
composition and Joint IJ-composition. In this situation, the com- 
ponents may be converted to per cent components, with the crude rate 
difference as the base. For example, 


Combined IJ 
t.. —T 





X 100 = per cent of difference between crude 
rates which is due to differences in 


IJ-composition of p and P. 


Even in this case, however, two qualifications should be kept in mind: 
(1) the components include the effects of hidden forces behind the 
factors, J and J, and nothing in the components technique justifies 
inferences as to causal relationships—such inferences must be based 
on knowledge outside the statistical technique itself; (2) a small Re- 
sidual component may mask larger influences, in opposite directions, 
of factors not held constant in the analysis. The latter qualification 
means, for example, that Residual IJ may be less than Residual IJK; 
that is, the difference in J.J K-standardized rates may be greater than 
the difference in IJ-standardized rates. Thus, the results of a com- 
ponents analysis should be interpreted as “applicable within the frame- 
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work of the particular factors held constant.” For example, a Residual 
IJ per cent component of 35 per cent may correctly be interpreted as 
indicating that “the difference in JJ-standardized rates is only 35 per 
cent as great as the difference in crude rates,” and the corresponding 
Combined IJ component of 65 per cent may be interpreted as indicat- 
ing that “65 per cent of the crude difference is attributable to IJ-com- 
position.” But, as additional factors (K, L, etc.) are held constant, the 
Residual component may not always decrease (and the Combined 
component increase) with the addition of each new factor—in fact, it 
will not do so unless all the other factors (K, L, etc.) operate in the 
same direction. This characteristic simply points to the fact that the 
difference between two crude rates is not the equivalent of a concept 
like total variance of a dependent variable in regression analysis, for 
example, which will be increasingly “explained” as more independent 
variables are added to the regression equation. 

When not all of the components are positive, their interpretation is 
more complicated. For example, if Residual JJ—the difference between 
IJ-standardized rates—is greater than the difference between the 
crude rates, Combined JJ is negative. If all of the subcomponents have 
the same sign (that is, are negative), Residual JJ might be used as the 
base for per cent components, which can be interpreted as follows: 


. . ar: 
Residual IJ 





X 100 = per cent of the difference between /.J-stand- 
ardized rates which is observed or evident in 


the difference between their crude rates. 








— Combined IJ 
. X 100 = per cent of the difference between J.J-stand- 
Residual IJ . — ; 
ardized rates which is obscured (in the crude 
rates) by differences in JJ-composition. 
(This is a positive component because Com- 
bined IJ is negative.) 
— Net I J : 
- X 100 = per cent of the difference between J.J-stand- 
Residual IJ 


ardized rates obscured by differences in J,- 
composition, etc. 


Other combinations of positive and negative components do not 


% Strictly speaking, the addition of a new independent factor need not increase the amount of 
“explained variance,” but it cannot decrease it. 
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readily lend themselves to meaningful per cent components. In such 
situations, the components themselves may be used without conversion 
to per cents. For example, if t..—7..=2, Residual IJ = —25, and Com- 
bined IJ =27, we might simply say that while the crude rate of p 
exceeded that cf P by 2 points, the standardized rate of P exceeded that 
of p by 25 points, and that differences in JJ-composition were re- 
sponsible for these widely different results. The Net I, Net Jr, and 
Joint IJ subcomponents may be used to indicate the importance of 
j-composition independent of J, J-composition independent of I, and 
Joint I.J-composition in obscuring (in the crude rates) the difference 
between standardized rates. 

If a crude rate difference is allocated into three major components, 
and all are positive, each component might be expressed as a per cent of 
the crude rate difference. Or, if this solution is used in a comparison over 
time, it may be desired to use the crude rate of the earlier date as the 
per cent base. That is, we may write these components as 


‘ 4 Ni 
ee ee > 1a (~4 -=*) + EE <te 
; Nn.. - N.. 
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nN. 


+ u X (tig — Tis) (2 - x). 


If each of the terms in this equation is expressed as a per cent of 7’.., 
the crude rate at the earlier date, then the per cent which ¢.. is of T’.. is 
equal to 100 plus the three percentage components. In this case, for 


example, 


< 100 = per cent change in the total rate between 


the two dates as a result of changes in 





composition, assuming no change in 
specific rates during the period. 


AN EXAMPLE OF RESULTS 


In a study of labor mobility in six cities made in 1951,% this method 
was used to determine the extent to which city differences in job mo- 





% This research, conducted by the Chicago Community Inventory of the University of Chicago, 
was based on data obtained in the Six-City Mobility Study, one of the industrial manpower research 
studies sponsored by the United States Air Force under Project SCOOP. Findings are summarized in 
Evelyn M. Kitagawa, “Relative Importance and Independence of Selected Factors in Job Mobility, 


Six Cities, 1940-49,” op. cit. 
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bility rates were due to differences in the composition of the labor force 
in the various cities. To cite one particular example, the crude mobility 
rate (mean number of jobs held, 1940-49) of Los Angeles men was 32 
per cent higher than that of Philadelphia men, or an average of 3.14 
jobs as compared with 2.37 in Philadelphia. 

Table 1 presents data on mobility rates and composition, by migrant 
status and time spent in the labor ferce from 1940 to 1949, for men in 
these two cities. In the lower half of the table are the results of a com- 
ponents analysis of the difference between crude mobility rates of Logs 
Angeles and Philadelphia men with these two factors held constant— 
migrant status (J) and time spent in labor force (J). 

Examination of the specific mobility rates shows that Los Angeles 
men were consistently more mobile than Philadelphia men, even when 
data are cross-classified by migrant status and time in the labor force. 
However, it is also clear that migrants were more mobile than non- 
migrants, and persons in the labor force 5-9} years were more mobile 
than persons in for less or more time. Furthermore, the percentage dis- 
tributions which describe the composition of men in the two cities indi- 
cate higher proportions of Los Angeles men in the high mobility cate- 
gories—for example, 47 per cent of the Los Angeles men were migrants 
as compared with 13 per cent of the Philadelphia men. Thus, we expect 
that part, but not all, of the difference between crude rates for men in 
these two cities is due to differences in their composition with respect 
to migrant status and time spent in the labor force. The components 
analysis quantifies this relationship. 

Differences in composition with respect to both migrant status and 
time spent in the labor force account for 47 per cent of the difference 
between crude rates of Los Angeles and Philadelphia men. And, the 
difference between J.J-standardized rates for men in these two cities is 
53 per cent of their crude rate difference, using their average IJ- 
composition as standard (Residual JJ). 

Furthermore, migrant composition alone, independent of time spent 
in the labor force, accounted for 38 per cent of the crude rate difference, 
with only 1 per cent due to composition by time spent in the labor 
force independent of migrant composition, and 7 per cent to these two 
factors jointly. 

Similar components were computed for selected pairs of cities, with 
these and other factors held constant, to determine which factors were 
most important in accounting for city differentials in crude mobility 
rates and to what extent these city differentials reflected differences in 
specific (or standardized) mobility rates. Space does not permit a more 
detailed analysis, but the purpose here is only to illustrate the use of 
the method in a specific set of data. 
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Measures of the sampling variability of per cent components have 
not been determined. Since each component is based cn the difference 
between two sums of a large number of products, its sampling variance 
may prove to be too unwieldy to estimate, though further study of this 
problem might yield some simplifying assumptions which will furnish 
approximate estimates of sampling variance without overburdening 
computations.** With census data based on complete counts or with 
very large samples where cross-classifications do not run thin, per cent 
components should be relatively stable or reliable. But with small 
samples or with smail cell frequencies in complete counts, inferences 
should be made with caution. 


COMPARISON WITH PREVIOUS DEFINITIONS OF COMPONENTS 


Reference was made in the introductory paragraphs to previous work 
involving the allocation of a crude rate difference into components. The 
set of components proposed in this paper differs from those used previ- 
ously in two respects. First, the earlier approaches selected one of the 
two populations being compared to provide weights for one of the two 
major components, considered this population the standard population 
for the components analysis, and did not make explicit the weights used 
in the other major component which was obtained as a residual (by 
subtracting the computed component from the crude rate difference, 
or an equivalent procedure). The algebraic presentation in this paper 
has made explicit the set of weights which was implicit in such a two- 
component solution; for example, when the IJ-specific rates of one 
group are used to weight differences in J.J/-composition, the IJ-compo- 
sition of the other group is used to weight differences in IJ-specific 
rates. The two-component solution proposed here uses a standard 
population having the average J.J-composition and the average I.J-spe- 
cific rates of the two groups. 

Second, the rationale of a set of subcomponents in this paper is more 
defensible than that in the previous literature.*” This can best be seen 
by noting that earlier definitions of the Joint JJ subcomponent, if 





% One place in the analysis where tests of significance can be made is in the determination of 
whether one population’s specific rates are on the whole larger than another's. The test consists of con- 
sidering each sub-group in one population with its comparable sub-group in the other population as a 
four-fold table. One can then sum the chi-squares from the individual four-fold tables. (See Karl Pearson 
and J. F. Tocher, “On Criteria for the Existence of Differential Death Rates,” Biometrika, 11 (1916), 
159-64; S. A. Stouffer and Clark Tibbitts, “Tests of Significance in Applying Westergaard’s Method of 
Expected Cases to Sociological Data,” Journal of the American Statistical Association, 28 (1933), 293- 
302; H. F. Dorn and 8. A. Stouffer, “Criteria of Differential Mortality,” Journal of the American Sta- 
tistical Association, 28 (1933), 402-13). 

27 To the writer’s knowledge, the only previous work involving subcomponents is contained in the 
cited reference to Goldfield’s method and the Turner and Kitagawa references. The comments here are 
applicable, for the most part, to the Turner and Kitagawa definitions; although Goldfield’s definition of 
a Joint subcomponent was similar, he did not retain it as a measure of Joint composition but allocated 
it back to the factors involved. 
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TABLE 1 


JOB MOBILITY RATES (MEAN NUMBER OF JOBS HELD 1940-49) 

- AND COMPOSITION (PERCENTAGE DISTRIBUTION) BY MIGRANT 

STATUS AND TIME SPENT IN THE LABOR FORCE, FOR Los 
ANGELES AND PHILADELPHIA MEN: 1940-49* 














o an Difference 
Migrant Status Mobility Rates | Composition (%) a ee 
and Time in Phila.) 
se Los | Phila- | Los | Phila- Ai 
Angeles | delphia | Angeles | delphia | Rates siti ~ 
All Men 3.14 2.37 100 100 Ay ej 0 
Less than 5 yrs. 2.90 2.42 11 7 .48 4 
5 but less than 94 yrs. 3.82 3.26 30 26 .56 4 
94-10 yrs. 2.84 2.03 59 67 81 —8 
Migrants 3.77 3.13 47 13 .64 34 
Less than 5 yrs. 2.89 2.29 6 1 .60 5 
5 but less than 9} yrs. 4.07 3.43 17 4 .64 13 
93-10 yrs. 3.79 3.15 24 8 .64 16 
Non-migrants 2.58 2.26 53’ 87 .32 —34 
Less than 5 yrs. 2.92 2.45 5 6 47 —] 
5 but less than 9} yrs. 3.49 3.23 13 22 .26 —9 
93-10 yrs. 2.20 1.88 35 59 .32 —24 























Components of the Difference Between the Crude Job Mobility Rates of 
Los Angeles Men (8.14) and San Francisco Men (2.37) 
(Two Factors: J =time spent in labor force 1940-49; J = migrant status) 











. Per Cent Actual 
Name of 
Component Component Component 
(t..—T’.. =100) (t.. —T’.. =.77) 
Combined JJ 47 .359 
Net J; 38 .296 
Net J, 1 .008 
Joint IJ 7 .054 
Residual JJ 53 .411 





* Data refer to a probability sample of 1,313 men in Los Angeles and 1,571 men in Philadelphia 
who worked one month or more in 1950. Persons residing in each city in 1951 who had resided there 11 
years or less (that is, who moved there after the beginning of the 1940-49 decade) were classified as 
migrants for purposes of this study. 
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translated into the algebraic notation, do not reduce to a weighted sum 
of differences in composition resulting from a combination of the net 
composition of one group with the gross composition of the other. In 
previous definitions the Net J; and Net J; subcomponents were defined 
independently, and then the Joint JJ subcomponent was computed by 
subtracting Net Jy and Net J; from Combined IJ, without determin- 
ing, algebraically, what such a residual measured. For example, when 
the specific rates of P were used as weights for the Combined IJ com- 
ponent in previous definitions, the subcomponents were defined as fol- 
lows, if translated into our notation: 


e ij Ni; 
Net Ty = > ¥) Ty 4 (= — =) 


n.j N.; 


Nj. Wis ce se N.,; N.. 





Ni ni. N; ij Nu n. 
Joint Y= LD Tal jn j oa j ~~) 


However, if the rationale of a set of subcomponents in this paper is 
used to compute subcomponents for the same Combined JJ component 


ij Nii\* 
(Combined 7 So KE — —]), 
e & n.. } oe 


the results could be 
either 





i j N.. j N.; 
. Nix. Nij Nij 
Net Jr = ey X ty (= —_ az) 


Joint IJ = I Dra Site & 





°8 Two similar sets of subcomponents could be obtained if the rates of p (instead of P) were used as 
weights in the Combined IJ component; equations for these two sets can be written by interchanging 
the roles of p and P in the weights of the equations above. Thus, four sets of subcomponents might be 
defined for the two Combined IJ components. Averaging the four values for each subcomponent would 
give a single set of subcomponents which is identical to the set defined in this paper. 
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or 
N.; Nij Nij 
Net = DY Tw “(4 - —*) 
ee. 2 |e N.j N.; 
ni, [ Nj N ij 
Net Jr = DD Ty (“4 - 
Y i oo NN. Ni. 


Ni; i. ij N.; 
Joint IJ = > >> tu(— ac peage ). 


i j 
COMPARISON WITH WESTERGAARD’S METHOD OF EXPECTED CASES 


The logic and mechanics of Westergaard’s method of expected cases 
is described by Woodbury in a study of infant mortality rates.?® Jaffe 
describes the method as “essentially an elaboration of the conventional 
standardization technique which can be used in a situation where the 
investigator wishes to isolate the influence of a single factor from that 
of other associated factors.”*° 

Basically, the method involves the computing of ratios of the actual 
number of cases in a particular group to the expected number of cases 
in the same group should it retain its own composition while being 
exposed to the specific rates of a standard population (with the total of 
the various subgroups usually used as standard). For example, Wood- 
bury in the study cited above, uses the method to isolate the influence 
of order of birth on infant mortality holding constant age of mother and 
earnings of father. If we let factor IJ represent age of mother and factor 
J earnings of father, then in our notation we might say 


n;;™ =number of births in both the ith category of J and the jth 
category of J in the population of k-order births 
N,;=number of births in both the ith category of I and the jth 
category of J in the population of total births 
t;;™) =infant mortality rates for births in the ith category of I and 
the jth category of J in the population of k-order births 
T ;;=infant mortality rates for births in the ith category of I and 
the jth category of J in the population oi total births 


His results are summarized in a table similar to Table 2. A comparison 
of the Westergaard ratios in the second column is used to indicate the 
extent of the variation in infant mortality by order of birth with the 





29 R. M. Woodbury, “Westergaard’s Method of Expected Deaths as Applied to the Study of Infant 
Mortality,” Journal of American Statistical Association, 18 (1923), 366-76, and reproduced in Jaffe, op. 
cit., Chap. III. 

30 Jaffe, op. cit., p. 48. 
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influence of factors J and J eliminated. Also, comparison of the ratios 
in the first and second columns (the former measure crude or unad- 
justed birth-order differentials in infant mortality) indicates the in- 
fluence of factors J and J—that is, IJ-composition—on birth-order 
differentials in infant mortality rates. However, the Westergaard tech- 
nique does not quantify this aspect of the analysis—that is, it does not 
measure what part of the crude birth-order differentials is attributable 
to factors I and J. 

If we compare the Westergaard framework with conventional stand- 
ardization and the components analysis, several relationships become 
evident. First, the components framework applied to the analysis of 
birth-order differentials in mortality might be used to analyze the dif- 
ference between crude infant mortality rates of any two birth orders 
into components due to differences in JJ-composition on one hand and 
to differences in I.J-specific rates on the other hand. That is to say, it 
quantifies a relationship which is apparent in a Westergaard analysis 
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but not rigorously measured in terms of the “components” concept 
defined in this paper. 

To the writer’s knowledge, there is no published work comparing the 
Westergaard technique with conventionally standardized rates se- 
lected to accomplish the same purpose, although the technique has 
always been recognized as a “standardization” method. Examination 
of the formulas for Westergaard’s ratios in the table above reveals that 
these ratios are actually equivalent to ratios of indirectly standardized 
infant mortality rates for each birth order (with the population of total 
births as standard) to the crude infant mortality rate for total births 
(i.e., the standard population). For example, the indirectly standard- 
ized rate for k-order births, with total births as the standard popula- 
tion, is given by* 


actual infant deaths to k-order births ‘Pie death rate ") 
expected infant deaths to k-order births 





standard population 


or 


De De bP ny 
: a . 
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3 





And if this is divided by the crude rate (7'..) for total births we get the 
Westergaard ratio.® 

Thus, if the objective is to compare the relative incidence of mor- 
tality from one birth order group to the next, holding constant the 
disproportionate weighting of the various groups by age of mother and 
earnings of father, the Westergaard ratios might be considered as ap- 
proximations in the same sense that indirect standardization approxi- 
mates the results of direct standardization. Ratios based on directly 
standardized rates,** with the population of total births as standard, 
would be defined 
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3 Jaffe, op. cit., pp. 44-8. 

32 The Westergaard ratio for a category may also be described as the correction factor which is 
applied to the crude rate of the standard population to obtain the indirectly standardized rate for that 
category. 

33 Such ratios have been used in at least one study—Donald J. Bogue, A Methodological Study of 
Migration and Labor Mobility in Michigan and Ohio in 1947 (Oxford: Scripps Foundation for Research 
in Population Problems, 1952), p. 64. 
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The only factor which varies in these ratios for different birth orders 
are the IJ-specific rates of the birth orders, while in the Westergaard 
ratios the composition which appears in the numerator and denomi- 
nator of each ratio also varies from one birth order to the next. That is 
to say, Westergaard ratios do not strictly speaking, hold IJ-compo- 
sition constant from one birth order to the next. However, there may 
be good reasons for computing ratios of indirectly standardized rates 
such as the Westergaard ratios.*4 


EXTENSION OF FRAMEWORK TO THREE VARIABLES 


Extending the components framework to three variables (J, J and K) 
is simple for the major components—Combined IJK and Residual 
IJK. In this case 


t... — T... = Combined //K + Residual IJK 


where 
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However, expressions for net and joint subcomponents would be 
quite complex if the two-factor model were extended to three factors. 
A much simpler solution results if one is willing to consider two of the 
three factors as a single factor and apply the two-factor framework. For 
example, if we are primarily interested in the influence of J independent 
of both J and K, we might consider J as one factor and the cross- 
classification of J by K as a second factor—the latter will be denoted 
by (JK). Then, in addition to the combined and residual components 
defined above, we could define the following subcomponents: 
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3! Two of the most obvious are: (1) when IJ-specific rates are available only for total births and not 
for each birth order, and (2) when enough of the JJ-specific rates for the various birth orders are based 


on too small numbers for stability. See Peter Cox, Demography (London: Cambridge University Press, 
1950), Chapter 7, for a statement of weights implicit in indirect standardization. 
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In this situation, Net Iyx represents the part of the difference be- 
tween crude rates attributable to differences in J-composition inde- 
pendent of both J and K, while Net (JK); measures the part attributa- 
ble to combined J K-composition independent of J. 

Such an analysis could be made with any pair of the factors con- 
sidered as a single factor. Also, if four or more factors are combined in 
any way to reduce to two, a similar approach could be used. 





RECENT DEVELOPMENTS IN THE SAMPLING OF 
HUMAN POPULATIONS IN GREAT BRITAIN* 


C. A. Moser 
Division of Research Techniques, London School of Economics 


This article describes the main developments in the sam- 
pling of human populations in Great Britain during the last 
five years. Changes in methodology, as well as new applica- 
tions, are included. 


PREVIOUS article, published in this Journal in 1949 [44], described 
A the main British activities in the field of sample surveys of human 
populations at that time. Since then the picture has changed in various 
respects and it seems worth while now to bring the earlier account up 
to date. The purposes of this article are to describe the major changes 
that have taken place in sampling practice, to mention some new appli- 
cations and to give a guide to the somewhat diffuse literature. Once 
again, only surveys concerned broadly with human populations’ are 
discussed ; emphasis throughout is on methods rather than results. 


DEVELOPMENTS IN METHODOLOGY 


Developments in the methods used can be grouped under four heads: 
sampling frames; types of sample design; refinements in random 
(probability) sample designs; and research on methodology. 

Sampling Frames. A determining feature of sampling practice any- 
where is the nature of the sampling frames available. In the U.K., 
random samples for most large-scale surveys are selected from one or 
another of the national population lists which are at hand. Their 
existence largely explains why area sampling has not been used here. 

In 1949 three lists were mentioned as being particularly useful to 
samplers: the National Register, the Rating List and the Register of 
Electors. The general view about their relative suitability for sampling 
was expressed by Gray and Corlett [24]: “ . . . we would advocate the 
use of the Rating Records for samples of households or housewives and 
the National Register for samples of individuals, or particular age- 
and sex-groups, of demobilized persons, and also for surveys of the 
town-planning type”. The paper referred to gives a full account of the 
various frames. 





* IT should like to express my thanks to statisticians and other experts in Government departments 
and elsewhere and to colleagues at the London School of Economics for their comments on a draft of 
this paper. The paper in its final form is entirely my responsibility and the comments made in no way 
commit the experts who kindly gave me advice. 

1 One or two surveys of economic institutions, e.g , firms, are included. 
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The National Register, however, was only available to official 
bodies so that most of the organizations in the sample survey field 
could not use it even if they wanted to. This, among other things, led 
the Social Survey to conduct an inquiry into the main alternative, the 
Register of Electors, and a report on the findings was published by 
Gray, et al., [25]. In some respects this frame always has been attrac- 
tive to the sampler. It is available centrally and locally, its order and 
arrangement are convenient, and the individuals on it are numbered 
serially, thus facilitating quasi-random selection, The misgivings 
about the Register had been that not everyone entitled to be on it 
would in fact appear on it and that, since it was revised only once a 
year, it was always somewhat out of date. More precisely, since the 
Register is published 4 months after the so-called “qualifying date” 
for registration, it is that much out of date when it appears and 16 
months out of date by the time it is replaced. 

The results of the Social Survey’s sample check on the accuracy of 
the Register were reassuring. About 96% of those entitled to be on it 
were in fact listed, 92% still being at their registered address. The sam- 
ple check took place 7 months after the qualifying date and the authors 
estimated that, at the time of replacement of the old Register, 87% 
of those entitled to be on it would still be at their registered address. 
The effect of the initial loss of 4% on the composition of samples was 
not, in the authors’ view, likely to be serious. More important were the 
removals, which accounted for the remaining inaccuracy. Movers tend 
to contain a high proportion of young people so that there is a danger 
of bias (cf., the findings in a recent readership survey [28]). Gray, et al., 
discussed this danger and proposed a method for dealing with the 
problem in practice. 

Broadly speaking, it now seemed that the Register of Electors could 
with suitable care be used as a frame for samples of individuals and 
households. The importance of these findings was enhanced by the dis- 
continuation of the National Register in 1952. Since then the Register 
of Electors has become the most commonly used frame for random 
samples, both by the Social Survey and others. Edwards [19] has 
described the practical problems involved in its use. The Rating Rec- 
ords are the only other frame commonly used for large-scale surveys; 
several inquiries based on it will be mentioned below. 

Types of Sample Design. The author’s 1949 article referred to the 
occasional practice—in otherwise random sample designs—of selecting 
first-stage units, say towns, purposively. It also discussed the con- 
tinuingly wide use of quota sampling and commented that this method 
“should . . . not be employed in any surveys on which administrative 
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decisions are based.” Now, in 1955, the use of purposive selection in 
the above manner and that of quota sampling have virtually dis- 
appeared from such surveys. 

In market and opinion research, the economic attractions of quota 
sampling are still regarded by many as sufficient to commend its use, 
but even here a tendency increasingly to use random methods is 
discernible. 

Refinements in Random Sample Designs.? To begin with, we should 
note the influence upon random sampling practice in this country of 
the work of the Government Social Survey. Most of the designs now 
used in official and other surveys bear some resemblance to those 
initiated by the Survey and described in the paper by Gray and 
Corlett [24]. 

(a) Stratification Factors. In most national surveys of the general 
population administrative districts (county boroughs, urban districts 
and rural districts, etc.) or parliamentary constituencies are used as 
first-stage units; when there are more than two sampling stages, wards 
or polling districts are the most common second-stage units; addresses, 
households or individuals being the final-stage units. The first-stage 
units are usually stratified geographically and by urban/rural dis- 
trict. Administrative districts, which vary enormously in size, used 
also to be stratified by population size. At the time of the previous 
article, these stratifications were the only ones in general use; inter- 
mediate-stage units, such as wards or polling districts, were generally 
not stratified at all, because of the lack of suitable data. The main 
weakness in all this was that neither the typical first-stage nor sec- 
ond-stage units were generally stratified by anything related to eco- 
nomic or social characteristics. Since then, two indexes relevant for 
this purpose have been developed. Both are due to the Social Survey. 

The industrialization index. This index, which is described by Gray 
and Corlett [24], is the “ratio of industrial to total rateable value in 
each district” and appears to be a useful index of the economic charac- 
ter of a district. With its development, the Social Survey turned to a 
new type of sampling design: administrative districts are no longer 
stratified by size of population, but by industrialization index (as well 
as region and urban/rural district), the selection within strata then 
being made with probability proportionate to size. This type of design 
has been widely followed by other orggnizations. 

The J~index. The more serious weakness of earlier designs was the 





* Two qualifications must be made about this section. In the first place, it will apply only to samples 
of the general population; there is no space to discuss refinements introduced in the many surveys of 
special groups. Second, theoretical developments are not discussed. 
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lack of data by which either wards or polling districts could be stratj- 
fied. Indeed, it was because of this that many random sample designs, 
including those of the Social Survey, were designed on a two-stage 
basis, with no stage intermediate between administrative districts (or 
constituencies) and the final-stage units. This, of course, was costly 
in terms of field-work. The Social Survey’s search for possible stratifi- 
cations resulted in their development of the so-called “J-index’, 
described by Gray, Corlett and Jones [26]. On the Register of Electors, 
electors liable for jury service are marked with the letter “J”. As 
qualification for jury service is a function of the rateable value of the 
premises occupied by the elector, it seemed reasonable that the “per- 
centage of registered electors with this suffix in any area might serve 
as an indicator of the economic status of the area”. Tests of the index 
confirmed this. The index has the further advantage that it can readily 
be calculated for any area, whether ward, polling district or, for that 
matter, administrative district, constituency or any arbitrary area. It 
is used regularly by the Social Survey and occasionally by other 
organizations. The set of J-indexes for England and Wales is expected 
to be published in the near future. Corlett has given some additional 
details on the index in [11] and [12]. 

Another factor for stratification which is occasionally used is the 
proportion of voters voting, say, Conservative, at the last election. 
If constituencies are the first-stage units, this is a good stratification 
to use. (Constituencies have several points to commend them. For one 
thing, they do not vary as much in size as administrative districts; 
for another, the Register of Electors is arranged by constituencies). 
Other possible stratification factors might be derived from the 1951 
Census of Population: for instance, the proportion of occupied persons 
in an area who finished their full-time education at a given age; the 
proportion of adults in an area in a given social class (the criterion 
being occupation) ; housing density and so forth. 

Possibilities of stratifying households, addresses or individuals are 
limited. It would be practicable to stratify a Register of Electors’ 
sample by sex, as the sex of most individuals can be deduced from their 
Christian names. More useful would be stratifications by age, income 
or occupation but none of these are feasible with our national sampling 
frames (except, of course, after selection). 

(b) Other Points. Selection with varying probabilities has been a 
regular feature of population sampling in the U.S. for many years 
and, as already mentioned, has become fairly general practice here. 
Edwards [19] describes a national four-stage random sample in which, 
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at the first stage, administrative districts were selected from within 
strata with probability proportionate to population; then polling 
districts were selected within the chosen administrative districts 
again with probability proportionate to population; the third-stage 
units were households and the fourth-stage units individuals. Over-all 
probabilities were kept equal. The problem of whether to sample with 
or without replacement has been tackled in various ways. Sometimes 
the difficulty has been by-passed by selecting systematically within 
strata (see the example in [24]); at others the selection has been made 
without replacement, but the sampling errors calculated (much more 
simply) as if it had been with replacement. Very occasionally, the 
selection has been made with replacement, the sampling units being 
included more than once if necessary. This is inconvenient, though 
not as serious a drawback as is sometimes suggested; in the official 
Household Expenditure Enquiry described below, one rural district 
was included twice. In this connection we may refer to a result in a 
paper by Durbin [15] to the effect that, if sampling is done without 
replacement but the standard error calculated as if it had been with 
replacement, the true standard error will be overestimated, but by only 
a small amount. 

As a result of the new stratification indexes described above, na- 
tional samples are now usually three-stage designs, where formerly 
two-stage designs were more common. Although the resultant saving 
in cost is obvious, surprisingly little is known, or at any rate published, 
about the effects of different degrees of clustering on variances and 
costs, about the benefits of the various stratifications or, for that 
matter, about the precision attained in general. As the users of survey 
results come to attach greater importance to the size of sampling 
errors, so samples will be designed more with an eye to easing the 
sampling error calculations. If this point is not borne in mind during 
the design stages, these computations can become prohibitively com- 
plicated. Practitioners are then tempted to make rough “back-of-the- 
envelope” estimates of sampling errors by using the formulas applying 
to simple random sampling. Since the samples are generally stratified 
multi-stage designs, these formulas are inappropriate and misleading. 
They may underestimate the true sampling error by a considerable 
margin (see Lydall [33]). 

It often happens that the population to be covered differs from that 
of the most suitable frame. Such was the case in the survey described 
by Edwards [19] to which reference has already been made. Here a 
sample of individuals aged 16 and over had to be drawn from the 
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Register of Electors, which covers adults aged 21 and over. The solu- 
tion adopted was to use the Register for drawing a sample of house- 
holds and then to select individuals from a list of household members 
drawn up by the interviewer. The procedure used was that suggested 
by Kish [30] and was found to work well. 

Research on Methodology. One of the important changes since 1949 
is the growing attention given to research into survey methodology. 
To some extent every survey organization conducts research into its 
own methods but it is a fact that most of them are too busy with 
commissioned surveys to devote much time and staff to research. 
Some valuable findings of course come about as by-products of every- 
day surveys but, unless the surveys have been designed with this in 
view, the results usually lack the decisiveness of experimental conclu- 
sions. 

Outside the survey organizations themselves, one of the develop- 
ments to note is the establishment, at the London School of Econom- 
ics, of the Division of Research Techniques, which devotes a part of 
its resources to research on survey methods. The Division has been 
financed since its inception in 1949 by the Nuffield Foundation and is 
directed by M. G. Kendall. It is charged with the task of studying 
“those branches of methodology required in the research undertaken 
by the School”, so that its attention to survey methods is logical. 

Four of its researches should be mentioned although, since all have 
been published, there is no need to go into detail. The first experiment 
set out to test differences in performance and results of experienced 
professional and inexperienced student interviewers. Though the 
former were of course expected to be more successful in obtaining 
interviews, one did not know how big the differences would be or 
what effects it would have on results. The evidence on these two 
questions is presented in papers by Durbin and Stuart [18] and Booker 
and David [4]. 

The Division’s second field research program was into the nature 
and efficiency of quota sampling. A report on the practice of quota 
sampling in this country—published by Moser [43]—served as the 
preliminary stage. The central part of the research was an experiment 
designed to study the relative accuracy and precision of random 
sampling and of different types of quota sampling. The findings have 
been published by Moser and Stuart [45]. The discussion following the 
paper, and published with it, contains the views on the quota sampling 





DEVELOPMENTS IN SAMPLING IN GREAT BRITAIN 1201 


controversy of a number of experts in this country. The Division’s 
third experiment, which dealt with variability in the coding process, 
has been described by Durbin and Stuart [16]. 

The Division’s most recent field experiment was designed to study 
the question of call-backs in random sample surveys and, at the same 
time, the effects of different degrees of clustering on variances and 
costs. The results were published by Durbin and Stuart [17]; they 
included some reassuring findings on interviewer variability. A 
theoretical paper on the non-response problem has been published by 
Durbin [14]. The Division, like many other bodies interested in sample 
surveys, is now turning its attention increasingly to the study of re- 
sponse errors; it is currently engaged in research on interviewer vari- 
ability (see also [42]) and is planning a project on memory errors. 

Two general features of the Division’s survey researches may be of 
interest. In the first place, they all are based on experimental designs 
in the modern statistical sense. Secondly, they are planned by a 
Committee on which members of the Division are joined by repre- 
sentatives of the Social Survey, the BBC Audience Research Depart- 
ment and several of the leading market and opinion research organiza- 
tions. The fieldwork is carried out by the professional organizations and 
the planning and execution of the projects is a joint effort between the 
academic members and survey practitioners. Experience during the 
last few years suggests that this cooperation is fruitful for both sides. 

The Division is not the only academic institution in this country 
with an interest in survey methods, although there seem to be hardly 
any others which devote a substantial part of their resources to re- 
search into survey methodology per se. The Department of Applied 
Economics of Cambridge University has, in the course of its research 
on social accounting, done a good deal of work on the methodology 
of family expenditure studies and on the use of sampling in examining 
national or local accounts. Among publications reporting on the former 
are papers by Utting and Cole [55] and [56]; among those dealing with 
the latter, and therefore with samples of documents and institutions, 
are those of Utting [54] and Stone, et. al., [51]. The Oxford Institute 
of Statistics has produced many points of methodological value in the 
course of its Savings Survey (see, in particular, Lydall [33]). 


NEW APPLICATIONS 
We shall not attempt anything like a comprehensive coverage of new 
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applications since 1949. The few cases selected are intended to illus- 
trate the gradual widening of the field to which sampling methods 
are being applied. 


Official Sample Surveys 


1. The Government Social Survey 


The Social Survey’s role in government administration has not 
altered since 1949. Gray and Corlett [24] gave an illustrative list of 
surveys carried out between 1946-49; a later description of the Survey’s 
work is given by Moss [46]. Of methodological interest have been 
several surveys concerned with the prediction of behavior, such as the 
enquiry into the demand for campaign medals [59]. At the end of the 
war some 7 million people were entitled to wear medals and, if all of 
them were to claim their medals, this would have meant striking about 
20 million medals. The Survey was asked to predict the likely demand 
and used an attitude-scale type of approach to do so. Their prediction 
that only about a third of those entitled to medals would apply for 
them proved very close to the actual demand; as a result some £200,000 
was saved. 

An earlier survey predicting the effects of freeing the distribution of 
domestic coal was useful in its demonstration that, at that stage, such 
a measure would have had disastrous effects on the export surplus. 
In a very different field, a recent study of the use of telephone direc- 
tories [31] enabled the Government to effect various economies and 
again save a substantial sum of money. 

A study of considerable methodological interest, especially in the 
analysis of data, is the attempt to predict the success of training at 
Borstal institutions and in Approved Schools. The results and methods 
are described in a book by Mannheim and Wilkins which will be 
published shortly by H.M.S.O. A short technical paper by Wilkins 
[58] has already appeared. 

A current inquiry into hospital nursing is employing sampling 
techniques for timing the observations made in a number of hospital 
wards. These observations will be used in conjunction with attitude 
studies amongst patients and nurses to gauge the effects of proposed 
changes in the organization of nursing duties. 

On the subject of health the Survey has done an interesting study of 
General Practice. Two articles by Gray and Cartwright have dealt 
respectively with “Choosing and Changing Doctors” and with the issue 
and use of medicines [22, 23]. 





DEVELOPMENTS IN SAMPLING IN GREAT BRITAIN 1203 


A considerable part of the Survey’s work is concerned with con- 
sumer expenditure. Over the last few years there have been inquiries 
into expenditure on laundry, dry cleaning and shoe repairs; household 
textiles and furnishing fabrics; house repairs and domestic service; 
meals in restaurants; pharmaceutical products; holidays; hairdressing 
and cosmetics; and other sectors of expenditure. These surveys are 
conducted for the Central Statistical Office to assist in its estimates of 
national expenditure. 

Another field in which the Survey is particularly active is nutrition. 
The National Food Survey is mentioned below; there have also been 
inquiries into the diets of young children and of other special groups. 

We may also mention at this point a large-scale anthropometric 
survey carried out for the Clothing Industry Development Council 
in 1951-1952 (see [8]), since a member of the Social Survey staff— 
W. F. F. Kemsley—was responsible for the analysis and report. The 
purpose of the survey was to assist manufacturers of ready-made 
clothing with the problem of sizing. In order to secure the necessary 
data, 37 measurements were taken on each of 5000 women between 
the ages of 18 and 70 and results on these measurements were then 
given for women of different ages and occupations, with different 
numbers of children and so forth. More important, the relationships 
between the various measurements have been established. 

These are only a few of the varied activities of this organization. 
Far more of its work—both in the way of survey reports and of 
methodological research—is now published than was the case in 
1949; indeed, this is obvious from the references already given. Alto- 
gether the Survey’s work stands at a high level of technical efficiency, 
and it is a tribute to its reputation and to the growing appreciation of 
the role of sample surveys that it has been able to continue through 
these years of economic pressure. It is to be hoped that this appreci- 
ation will remain and grow and that the Survey’s potentialities will 
be fully exploited by successive administrations. 

We should also note and welcome the increasing extent to which the 
Social Survey has been brought into the field on the borderlines of 
social and economic research. It has been linked with a part of the 
social mobility enquiry at the London School of Economics, with 
the Savings Survey at Oxford, with the work of the Department of 
Applied Economics at Cambridge (in this case to give technical advice 
on matters of design and analysis) and with a current survey among 
university graduates by Political and Economic Planning. These are 
only a few of many instances. 
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2. The General Register Office 


One of the features remarked ‘on in 1949 was the failure of the 
General Register Office to avail itself of sampling methods in its 
demographic work and, in particular, in the Population Census. The 
hope was then expressed that the Registrars-General would decide to 
use sampling in the 1951 Census. In the event, sampling was not used 
in the collection of data; but, for the first time, a preliminary analysis 
was based on a sample of the Census schedules, allowing some results 
to be published within 18 months of the Census data [20]. Details of 
the sampling procedure are given in the report, together with a dis- 
cussion of the representativeness of the sample and some adjustments 
made to the results. An interesting discussion of the accuracy of the 
sample appears in Yates [60], section 10.16. 

The speed with which the preliminary Census results became avail- 
able amply justified the Registrars’-General decision to analyze a 
sample of the schedules in the first instance. It is to be hoped that 
they will decide in favor of using sampling in the collection of, at any 
gate, some of the data at the next Population Census. Some questions, 
of course, must be asked of everyone, but on others sufficient precision 
could probably be attained with a sample coverage. The use of multi- 
phase sampling in the U. 8. Population Census could well serve as an 
example. There is also a strong case for preceding each Census with a 
series of sample surveys designed to test various forms of questions, 
definitions, methods of enumeration and so forth; and for following 
each Census by checks on the quality of data obtained, rather on the 
lines of the Post-Enumeration Surveys conducted by the U. 8. Bureau 
of the Census. 


3. The Ministry of Food 


The National Food Survey, described in the author’s previous article, 
still continues, but with its coverage now extended from the working- 
class to the whole population. Four reports on the methods and results 
of the Survey have been published [35, 36, 37, 38] and others are 
promised. As from 1953, the fieldwork, coding and analysis have been 
taken over by the Social Survey. 


4. The Ministry of Labour Household Expenditure Enquiry 


The present U.K. Index of Retail Prices is based on weights broadly 
in accordance with the results of the 1937/8 Ministry of Labour 
Family Expenditure Survey [41]; actually they were revised somewhat 
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as from January 1952 to represent the estimated 1950 pattern of con- 
sumption. Popular feeling, as well as technical opinion, had for some 
time favored bringing the weights properly up tc date as soon as con- 
ditions permitted and a report of the Cost of Living Advisory Com- 
mittee in 1952 [40] proposed that preparations should be made for a 
new Household Expenditure Enquiry. The proposal was accepted by 
the Government and the inquiry began in January 1953, continuing 
for one year. In determining the sampling plan, interviewing and other 
methods, experimental inquiries conducted by the Social Survey over 
many months were of very great value. The inquiry itself was directed 
by the Ministry of Labour, in consultation with the Central Statistical 
Office. In Northern Ireland the field work was carried out by the 
Northern Ireland Ministry of Labour, in Great Britain it was shared 
between the Ministry of Labour and the Social Survey. Few details 
about the methods of the inquiry have as yet been published and only 
some general points can be made here. 

The inquiry covered the entire (non-institutional) population of the 
country, although some classes of households may be excluded when 
computing the weights for the new index. The Government, however, 
accepted the advice of the Cost of Living Adviscry Committee that 
the additional information obtained from covering the entire popula- 
tion would be of great value and would add proportionately little to 
the over-all cost. 

Each sampled household was asked to provide expenditure data for 
a period of 3 consecutive weeks and the sample (a two-stage design) 
was so arranged that, for each geographical region as well as for the 
country as a whole, it was spread evenly over the year. This solution 
of the seasonal problem seems preferable to the more usual one of 
trying to get households to cooperate for, say, four weeks at quar- 
terly intervals. There were many features of methodological interest 
in this inquiry, for instance in the steps taken to secure cooperation, 
in the method of collecting the data (a mixture of interviewing and 
record-keeping) and in the schedules themselves; however, discussion 
of these must await publication from an official source. 


5. Ministry of Labour Age-Analysis of Employed Persons 


The Ministry of Labour makes annual estimates of the total number 
of employees in the country from the quarterly exchange of employ- 
ment insurance cards (see note in [44]). As the insurance cards do 
not state date of birth, these analyses do not give separate figures for 
age groups—except for a division into under 18 and 18 and over (these 
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groups have different contribution rates). The dates of birth are, how- 
ever, recorded on the ledger accounts kept by the Ministry of National 
Insurance and these serve as the sampling frame for the special analy- 
sis which produces detailed figures for each industry by age and ser, 
regional age distribution and data on the relation of age and unemploy- 
ment; the analysis is described in [39]. 


6. Ministry of Education 


An interesting case of a sample of a specialized population was pro- 
vided in a survey designed “to obtain an estimate of the mean score 
on a certain reading test of pupils aged 15 in grant-aided schools in 
England and Wales”. The sample was a two-stage one, with Local 
Education Authorities’ areas as first-stage units and schools as second- 
stage units. All the pupils of the required age in the selected schools 
were tested. The areas were stratified into urban and rural, and areas 
were then selected with probability proportionate to population. The 
schools were stratified by five types and selection was again made with 
probability proportionate to size. The sample design and estimation of 
sampling errors is discussed by Peaker [49]. 


7. Ministry of Agriculture 


The author’s previous article described the sampling scheme of the 
National Farm Survey. A similar type of design has been employed 
in the inquiries into earnings and conditions of employment in agricul- 
ture conducted by the Ministry of Agriculture. This was described by 
Palca and Davies [48]. 

The purpose is stated to be as follows: “In the agricultural industry, 
there is a statutory determination of minimum rates of wages related 
to a standard working week, and overtime rates and certain other 
elements of farm workers’ remuneration are also statutorily deter- 
mined . . . there was so far no information on average weekly earnings” 
and it is this gap the surveys seek to fill. They cover hours of work and 
numbers employed by occupation as well as earnings. 


8. Commissioners of Inland Revenue 


A weak link in British economic statistics is the relatively scanty 
information about the distribution of personal incomes. Although 
ingenious estimates have been made from time to time, no straight- 
forward official statistics have been available. 

The source from which estimates normally begin are the annual 
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reports of the Commissioners of Inland Revenue, the authority re- 
sponsible for the administration and collection of income tax. Gener- 
ally these reports—which are based on personal income tax returns— 
give distributions of incomes from specific sources (e.g., salaries, 
profits, rent) not distributions of income aggregated for each person. 
However, for incomes assessed in the years 1949-50, the Commissioners 
took a 10% sample of income tax schedules, aggregated the incomes of 
various kinds on each schedule and produced income distributions. 
The main results are given in two reports [9, 10]. Of particular interest 
are the breakdowns, in the former report, of income by household 
composition and those in the second, by geographical region. 

Though useful, these figures suffer from the drawback that, since a 
married man’s income tax schedule is likely to include his wife’s earn- 
ings, the derived distributions refer neither precisely to the incomes 
of individuals nor to those of families. In addition, there must always 
be some doubt about the accuracy of any figures derived from personal 
income tax returns. 


9. Board of Trade Censuses of Production and Distribution 


An important development was the decision to use sampling in the 
1953 Census of Production (i.e., the one covering production in the 
year 1952). In order to reduce the burden upon industry, it was decided 


to ask fewer questions generally and to require returns from only a 
sample of firms. All firms over a given size in each trade were asked to 
make returns, but only a sample of the smaller firms. Thus, most firms 
were in effect excused from the necessity of making returns. 

This made it possible to publish early estimates of the most impor- 
tant aggregates like total sales, materials used, stocks and capital ex- 
penditure. According to the statement announcing the decision to use 
sampling [1], a sample coverage was feasible only because detailed 
figures of output and material had been obtained for 1951 and were 
not required again for 1952. Such details could not have been satis- 
factorily obtained by sampling. The 1954 Census (for production in 
1953) is being conducted on a similar sample basis. 

The Government has now announced [8] that a full Census of 
Production is to be taken for the years 1957 and 1961 but that, even 
on these occasions, detailed returns will be required only from firms 
employing 25 persons and more. A Sample Survey of Production is to 
be taken in each year when no full Census is conducted. 

The same statement announces that a Census of Distribution is to 
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be taken once every ten years and that, during the interval, Sample 
Surveys of Distribution should be made every third or fourth year, 
This summarizes very briefly the official policy regarding the future 
of the Censuses of Production and Distribution. It represents a major 
step forward in the role of sampling for the collection of official sta- 
tistical data. 


10. Board of Trade Capital Expenditure Surveys 


In 1950 and 1951 the Board of Trade conducted inquiries “to ascer- 
tain the level of capital expenditure by manufacturing industry in 
Great Britain” [2]. There had been a Census of Production in respect 
of 1948, which covered capital expenditure, but these special inquiries 
made results available more rapidly and enabled capital expenditure to 
be related to calendar years. 


Public Opinion and Market Research 


There are no fundamental changes to report in this field, although 
one can discern a distinct raising of methodological standards. This is 
due both to increasing competition and to the spread of technical 
knowledge; that practitioners are taking an increasing interest in 
methodological advances is beyond question. 

In the field of public opinion research, there are no new applications 
of note known to the writer, but one may point to the success of the 
British Institute of Public Opinion in forecasting the general elections 
of the last few years, as well as local by-elections. Kendall and Stuart 
[29] describe a method for predicting the distribution of parliamentary 
seats from that of votes in the country. A paper by Durant [13] dis- 
cusses some of the problems of the Gallup Poll. 

The BBC’s activities in studying radio listening habits and tastes 
were described in the author’s previous article. The only development 
to note is the natural progression from “Listener Research” to “Audi- 
ence Research.” Attempts to cover the television audience meet with 
the inevitable difficulty of small numbers; the problems of research in 
this field are discussed by Silvey [50]. 

A field of activity which has become increasingly important in recent 
years is that of newspaper readership surveys. The interest of their 
results to newspaper proprietors and advertisers does not need stressing 
and there are now every year two or three sets of results available from 
different sources. We may refer to a booklet published by the Market 
Research Society [34] which gives a history of readership surveys, ex- 
plains their problems and makes suggestions for future work. In 1954 
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readership surveys were published by the Hulton Press [27] and by the 
Institute of Practitioners in Advertising [28]. 

Two other market research activities, slightly outside the normal 
run of consumer surveys, may be mentioned. One is the Consumer 
Panel, noted in the previous article, run by Attwood Statistics Ltd. 
Wadsworth [57! has discussed the Panel from the point of view of the 
user. The other activity which seems to gain importance is the retail 
panel or audit in which information derives from retailers rather than 
from consumers. The work of A. C. Nielsen Ltd. in this field is well- 
known and there is another retail audit panel in this country, run by 
the British Market Research Bureau, the working of which has been 
described by Treasure [53]. 

One market research casualty of recent years was the British Export 
Trade Research Organization, a body devoted to overseas market re- 
search. A discussion of the reasons for its failure was published by 
Treasure [52]. 

We have remarked on the increasing interest of market research prac- 
titioners in methodological advances. Indicative of this growing willing- 
ness to come out into the open, discuss their methods with each other 
and with academic experts, and to participate in methodological re- 
search, is the foundation of the Market Research Society and the good 
attendance at its meetings and weekend schools.’ Even more indicative 
is the increasing willingness to publish. Whereas the author’s previous 
article was unable to instance a single major paper published on market 
research in this country, quite a few can now be mentioned. To the 
ones already noted may be added the paper by Cauter [7] describing in 
detail the organization and work of a large-scale market research agency. 


Social and Economic Research 


There remain the numerous sample surveys conducted by universi- 
ties, research institutes and individuals. Year by year this type of re- 
search activity grows, partly, of course, as a consequence of the de- 
velopment of sampling; many current projects are only practicable 
because of sampling. There is an attendant danger, in so far as research 
workers are sometimes led to attempt inquiries on too extensive a scale 
and to spread their resources too thinly. However, this can be prevented 
if statistical advice is taken, and, by and large, the influence of sampling 
is much to the good. 





* The Association of Incorporated Statisticians held a weekend school on sample surveys late in 
1953. Several of the talks given on this occasion are to be published in a collection of papers on British 
Market Research edited by J. Downham, E. Shankleman and J. Treasure. 
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In the author’s previous article a number of sample surveys in 
this field were described and many of these continue, some in extended 
form. The Population Investigation Committee’s inquiry into ma- 
ternity (see [44]) has grown into a “National Survey of the Health and 
Development of Children.” A large number of the children covered in 
the original (1946) sample are still included in the investigation and 
records of their physical development, their progress in school, their 
reading ability and intelligence are being kept. As a long-term study 
this is unique in social research. The National Foundation for Educa- 
tional Research in England and Wales is joining the P.I.C. in under- 
taking a national survey of standards of attainment in reading and 
arithmetic at various ages. 

In the educational field we can also mention the inquiry—under- 
taken by Political and Economic Planning and the Social Survey—into 
the jobs currently held by 1950 university graduates and particularly 
into the use of arts and science graduates in industry. A survey into the 
social and educational background of applicants for places in universi- 
ties in 1955 and 1956 is in its early stages. This is sponsored by the 
Committee of Vice-Chancellors and Principals. A research team at the 
London University Institute of Education is conducting a national 
sample survey of the school teaching profession. 

Findings of the research on social mobility, which has been continu- 
ing at the London School of Economics for some years (see [44]), has 
recently been published in a volume edited by Glass [21]. Much of the 
material in this book is based upon a survey into the social origin, 
education and occupational achievement of a national sample of some 
10,000 adults. Ancillary investigations included work on the social 
prestige of occupations, the relation of education and social mobility, 
child upbringing and social class, the relation of social mobility and 
marriage, recruitment in a number of professions and the structure of 
voluntary organizations. 

Surveys on the travel habits of the people of London and Bristol 
have been undertaken by the London Transport Executive [32] and 
the British Transport Commission [6]. The Social Survey is currently 
making a major study of travel in London for the Road Research 
Laboratory and London Transport Executive. This attempts to get 4 
measure of the effects of the cost of transport delays as well as to pro- 
vide information on the demand and use of various transport services. 

There continues to be much research into the expenditure and saving 
patterns of different population groups. We have already mentioned 
the official 1953 inquiry and the work of the Oxford Institute of Sta- 
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tistics and the Cambridge Department of Applied Economics in this 
field. 

The National Health Service and the standard of General Practice 
have been the subject of a certain amount of research. The British 
Medical Association sponsored an inquiry into General Practice [5], 
one part of which was a field study of a sample of 188 General Practices, 
the other a postal inquiry directed to the remaining National Health 
Services general practitioner principals. 

These are only a handful of many current or recent social and eco- 
nomic research inquiries. The reader wishing to get an idea of the im- 
mense range of activity in this field should consult the register published 
annually by the National Institute of Social and Economic Research 
[47]. 

CONCLUDING REMARKS 


The four or five years since the time of the previous article have seen 
considerable advances in the sampling of human populations in this 
country. The field of application has widened, scepticism of the reli- 
ability of sampling seems to have become less widespread, and the 
standard of methodology has been raised. This applies almost equally 
to official, commercial and academic sample surveys. 

The causes of these changes are manifold. Among them certainly is 
the spread of knowledge of how sampling works, which itself comes 
from the increasing attention given to the subject in universities and 
from the growing literature. There are now available several good books 
on the theory and practice of sample surveys and university courses on 
the subject are attended by large numbers of students. Among the 
causes, too, is the interest newly taken by the universities in methodo- 
logical research in this field. Most important of all is the growing will- 
ingness of those who conduct—and those who commission—surveys to 
publish their methods and results. This must inevitably serve to stimu- 
late interest and to raise the standard of efficiency. 
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USES OF CRUDE VITAL RATES IN THE 
ANALYSIS OF REPRODUCTIVITY* 


GrEorGE J. STOLNITz 
Princeton University 


I. INTRODUCTION 


ross and net reproduction rates continue to be widely used in 

population studies, as they have been for the last quarter of a cen- 
tury. Although subjected to increasing criticism in recent years, such 
rates are still likely to be cited as uniquely valuable indicators of demo- 
graphic behavior and prospects, while their unavailability for many 
areas is almost universally taken to be a major gap in knowedge. The 
purpose of this paper is to consider whether the kinds of information 
commonly sought from reproduction rates may be inferred from crude 
vital measures, which are much more generally obtainable. In particu- 
lar, to what extent can it be expected that the two types of meas- 
ures will show parallel variations over time or space? 

In view of the importance usually attached to reproductivity pat- 
terns, the question is of obvious interest as a problem in estimation. 
But in addition it throws new light on some methodological issues which 
have long been regarded as resolved. Reproduction rates are supposed 
to provide a needed escape from crude rates and the possibility that the 
two might be substitutable in significant ways seems to challenge the 
very purpose of a large part of traditional demographic measurement. 
It is useful, therefore, to begin by reviewing the main arguments ad- 
vanced in support of reproduction rates. 

Almost invariably, such arguments have started with the premise 
that crude vital measures present a distorted view of demographic be- 
havior. A population’s crude birth or death rate for any period depends 
not only on the rate of childbearing or dying in each age class, but also 
on the relative numbers of persons in such classes.' Since these numbers 





* The author is indebted to Clyde V. Kiser, George F. Mair, Leighton van Nort and Norman B. 
Ryder for criticisms and to Erna A. Harm, Blanche E. Cogan, Elizabeth Souther and Hugh Dingle for 
computational assistance. 

1 The birth rate for any calendar period is defined as the ratio B/P, where B is the average annual 
number of births in the period and P the average size of the population (generally expressed in thou- 
sands). Let b; denote the births among women of age t, pj the number of women at that age, rj the age- 
specific fertility rate b;/p;, and f; the fraction p;/P. It follows that the birth rate can be written as 
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i.¢., a8 a weighted sum of the age-specific rates. (The births could as readily be classified by age of father, 
but the criteria for preferring one classification over another are often complicated and need not concern 
us here.) The death rate can be represented in the same way, the weights at each age being the ratio of 
Persons of both sexes to the total population. 
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are the outcome of earlier movements in fertility, mortality and 
migration, any crude rate can describe the present only at the expense 
of also involving the past. 

The avoidance of this difficulty has been cited as the great contriby- 
tion of gross reproduction rates in the study of fertility. Any two popu- 
lations with the same age-specific rates of childbearing will have 
identical gross rates, whatever the differences in their current age dis- 
tributions, The corresponding property holds for the net reproduction 
rate, this time in connection with the analysis of replacement. The 
percentage rate at which a population will eventually change if existing 
vital conditions continue into the future can very generally be deter- 
mined without any reference at all to its current age composition. The 
net reproduction rate provides a measure of this change, over time 
spans of a generation (approximately 30 years).? 

It is typical of the large literature on these questions that it deals 
almost exclusively with the problem of analyzing the experience of a 
given time and place. Little or no theoretical attention has been given 
to considering the contrasts between crude and reproduction rates for 
purposes of studying trends or spatial variations. This oversight has 
been a natural one in some respects. In the first place, reproduction 
rates give the appearance of being inherently dynamic, in that even a 
single value describes how the population may change over time. 
Thus if a given experience indicates a net rate of 1.2, it also implies an 
eventual population growth of 20 per cent per generation, and this 





2 Exceptions to the last two statements are conceivable but are of mathematical interest only. 
Actual populations subject to unchanging age-specific fertility and mortality rates will after a while 
take on a “stable age” form, whose age composition and rate of growth are constant over time. These 
are independent of the population's original age composition. 

Not the least of the attractions of reproduction rates has been their apparent common-sense appeal. 
Basically, the gross rate can be interpreted as measuring the number of children ever born to women who 
have already survived to the end of the reproductive period of life. The net rate, in addition to its func- 
tion as a stable-age parameter, is intended to measure the number of children among women who are 
traced from the time of birth and may die before completing the childbearing years. Popular interest in 
reproductivity is likely to focus on questions requiring just these kinds of information. No doubt the 
natural tendency is to think of actual groups of women. By ready analogy, however, it is a small step to 
consider hypothetical groups, whose reproductive “history” summarizes the experience of women at 
different ages during the same calendar period. As before, let {rj} denote the set of age-specific fertility 
rates (per 1,000 women) observed in a given period. Then 1,000 women, assumed to survive from birth 
through the reproductive years and to have the observed fertility rates, would have rg births at each 
age i. The total number of births during their lifetime would be =r;, the summation running from about 
age 15 to 50. The gross reproduction rate is defined as SXr; (SZr; +1,000 if expressed on a per woman 
basis), where S is the ratio of daughters to total births. (The reason for introducing this ratio is that it is 
often convenient to deal with parents and children of the same sex.) The net reproduction rate is defined 
as the number of daughters that would be born if the 1,000 women not only had the observed fertility 
rates but were also subject to the mortality riske of the period. Letting L; denote the fraction who would 
survive from birth to age i under the given mortality conditions, the number of births at that age would 
be r;Z; and the total children ever born 27;L;. The net rate for the period would therefore be S=riLi. 

All of the reproduction measures used here stem from these definitions. No attempt has been made 
to consider the analogous measures occasionally published for actual population groups (“real cohort” 
rates), for males, or for groups classified by variables other than age (on the last see [12]). 
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figure cun be compared with actual changes in the past. Secondly and 
more important, reproduction rates first came to be widely applied in 
the 1930’s and to the special conditions of Western nations. In all of 
these areas long-run fertility trends had been very predominantly 
downward before World War II. On the then plausible assumption 
that these trends would continue and would more than offset further 
declines in mortality, the net reproduction rate for any current period 
could be regarded as an upper estimate of the rates of growth that 
would actually take place in the future. In particular, the fact that 
most Western net rates of the time were close to or below unity seemed 
clearly to presage eventual population decline throughout the region. 
To many this seemed much more important than the observation that 
the net rates themselves varied from period to period. The continued 
excess of birth rates over death rates in most instances only added 
emphasis to this line of thinking. A single net rate often appeared to 
tell more about the future than a whole time series of crude vital 
measures. 

As a practical matter, however, reproduction rates have been used 
mainly in comparative contexts, the usual interest being less in individ- 
ual values than in the way these measures have varied over time or 
space. In this sense, the theory just summarized has been largely 
irrelevant to application. Granting that net or gross rates have unusual 
advantages for describing a single experience, it is by no means clear 
that similar advantages hold for comparative purposes. This is not, 
of course, to deny the usefulness of documenting variations as accu- 
rately as possible. Indeed, the importance of such variations is taken 
for granted for present purposes;? Otherwise there would be little 
point to seeing if adequate approximations may be obtained by indi- 
rect means. The primary purpose of this paper is to examine the prob- 
able extent of similarity between movements in gross and net rates on 
the one hand, and crude vital rates on the other. Nevertheless, many 
of the following findings are also implicit criticisms of the traditional 
theory, not so much for what it says as for what it overlooks. 


II. TEMPORAL VARIATIONS 


Very nearly all of the reproduction rates discussed below were taken 
from publications by Depoid, Kuczynski and the United Nations 
(1, 6, 10]. The time series derivable from these sources cover 27 coun- 
tries and come close to exhausting the existing materials for national 





} Whether this is justified is another matter. The fact is that reproduction rates are still primary 
tools in practice. The critical literature, although extensive, seems to have ignored the question raised 
here, i.e., whether the practical uses of reproduction rates have really marked a significant departure 
from older and simpler measures. (See for example [2, 5, 9, 13].) 
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populations through the late 1940’s, the terminal years for the data 
of this study.* 

Crude birth and death rates were obtained for each of the periods 
for which a reproduction rate was available for an area and the matched 
values used to compare changes between periods. For example, the 
reported gross reproduction rate for France was 1.31 in 1904-07 and 
1.23 in 1908-13, and the corresponding crude birth rates 20.4 and 19.3; in 
this case a decline of 6.1 per cent in the one measure could be compared 
with a decline of 5.4 per cent in the other. In view of the dimensional 
dissimilarity between the two types of measures, all variations have 
been expressed in percentage terms. 

1. The gross reproduction rate. Since this rate is intended to describe 
fertility proper, its obvious analogue among crude vital measures is 
the birth rate. The matching process just described yielded a total of 
460 paired changes between the successive periods on record in the 
various areas. Less than 6 per cent of such changes showed the meas- 
ures moving in opposite directions; the value of 7.4 per cent in the first 
column of Table 1 was obtained by adding half the number of in- 
stances in which a non-zero movement in either measure was paired 
with a zero variation in the other. Small as they are, these fractions 
greatly exaggerate the significance of the apparent dissimilarities. As 
can be seen from Figure 1 (second and fourth quadrants), both meas- 
ures varied by less than one per cent in over a third of the cases in- 
volving divergent movements. Almost always, therefore, a variation in 
the birth rate would have been sufficient to infer correctly that the 
gross rate moved in the same direction or that it failed to change ap- 
preciably.® 

With respect to magnitudes, the similarity in the two sets of vari- 
ations is brought out by the marked concentration of points about the 
main diagonal of Figure 1 and again by the position measures shown in 
Table 1. More than half of the paired changes differ by less than one 
percentage point and nearly four fifths by less than two points. Also, 





4 Many of the rates are for single years, but a large proportion are for longer periods. Partly because 
of this and partly because the time series are often incomplete, the time intervals between successive 
measures tend to be highly variable within a country as well as between countries. The reliability of the 
rates and the ways in which they were computed also tend to vary widely. Some of the nineteenth- 
century data, taken from Depoid, were obtained by methods which exaggerate the apparent parallelism 
of movements in crude and reproduction rates. There is little reason to suppose, however, that these 
factors ‘aave had any substantial overall effects. If the data are deficient in any serious way, it is with 
respect to the underrepresentation of the populations of Latin America, Africa and Asia. 

’ Eleven of the countries in the sample had 20 or more changes and accounted for nearly three 
fourths of the paired variations. Only three of these areas had appreciable proportions of divergent 
movements, some 13-14 per cent, while seven had proportions below 6 per cent. 
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TABLE 1 
SUMMARY OF DISTRIBUTIONS OF ABSOLUTE DIFFERENCES 


BETWEEN PAIRED PERCENTAGE CHANGES IN REPRO- 


DUCTION RATES AND CRUDE VITAL RATES* 

















GRR and NRR and aoe oe 
Item BR BR combination of 
BR and DRt 
1. All changes between successive measures 
No. of paired changes 460 431 431 
Divergent movements (%) 7.4 12.5 24.6 
Absolute differences between 
paired changes: 

Mean 1.3 2.3 5.3 
First quartile 4 5 1.8 
Median 8 1.2 3.8 
Third quartile Le 2.6 6.7 

2. Pre-1900 changes between successive measures 
No. of paired changes 75 65 65 
Divergent movements (%) 12.7 23.8 15.4 
Absolute differences between 
paired changes: 

Mean 1.2 3.5 3.4 

First quartile 3 8 1.6 

Median 7 2.8 2.7 

Third quartile $.¢ 5.5 4.3 

3. All 30-year changes 
No. of paired changes 112 88 88 
Divergent movements (%) 8.9 20.5 21.0 
Absolute differences between 
paired changes: 

Mean 4.4 10.2 9.1 

First quartile BY 5.8 3.6 

Median 3.8 9.2 7.8 

Third quartile 6.6 13.1 14.4 





* BR =Crude birth rate; DR =Crude death rate; GRR =Gross reproduction rate; NRR =Net 


reproduction rate. 


t The first of the combinations defined in footnote 13. 
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the differences in the total sample of changes have very nearly the same u 
mean and quartile values as do the differences in the pre-1900 sub- p 
sample. It follows that the distributions of differences before and after lc 


1900 were practically indistinguishable in these respects. Only the 
proportions of divergent movements differed notably between the two 
periods, being almost twice as large before 1900 as subsequently. The 
greater relative frequency of such movements in the earlier data might 
have been expected to result in larger average differences as well, 
That this did not happen is explained by the tendency for both meas- 


— «> = + eS 


Fie. 1. Tabulation of matched percentage changes in the gross reproduction 
rate and crude birth rate.* 
15 


5 


Percentage change in gross reproduction rate (¥) 


-10 


“15 
-5 -10 5 0 . 10 rt) 


Percentage change in crude birth rate (X) 





* Using changes between all successive periods on record in 27 countries. Changes of over 15 per 
cent in either measure occurred in about 1 of 10 cases. Estimating equation (all changes, ungrouped 
data): Y =.12+1.04X. 
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ures to change very little when varying in opposite directions. In each 
period the differences encountered in such instances had the effect of 
lowering, not raising, the average. 

These patterns are especially interesting in that the pre-1900 vari- 
ations usually spanned intervals of 5 or 10 years, while those for later 
periods were mainly annual changes. Thus it appears that the birth 
rate is almost as useful an estimator for intermediate periods as for 
very short-run intervals. This seems to be true even if we take account 
of the likelihood that some of the pre-1900 differences between the two 
measures are downward-biased (see footnote 4). 

To test for the kinds of relationships to be expected for longer periods, 
use was made of all the 30-year changes that could be derived from the 
original time series. For example, if gross reproduction rates in an area 
were available for 1901-05 and 1931-35, and again for 1906-10 and 
1936-40, both changes were computed and paired with the concurrent 
movements in the birth rate. The results are summarized in the bottom 
tier of Table 1. 

The larger errors generally encountered are in accord with expecta- 
tions. Differences between the percentage movements of the two meas- 
ures are likely to be substantially greater over periods of a generation 
than over intervals of a year or even a decade.* Nevertheless, the pres- 
ent results seem to the writer further evidence that the movements are 
sufficiently parallel for most practical purposes. In the first place, 
changes between dates as far apart as three decades are uncommon in 
actual studies of fertility trends. The reasonable presumption is that 
the average errors for such frequently cited intervals as 1 to 2 decades 
would be appreciably lower. Secondly, the errors to be expected should 
be considered in relation to the length of the period in question. The 
comparatively short-run changes in the gross reproduction rate which 
were discussed earlier had a mean absolute value of a little over 6 
per cent, whereas the 30-year changes averaged over 21 per cent. In 
relative terms, therefore, an average 30-year error of about 4 percent- 
age points is not much different from the previous results. Errors of 
this size would be prohibitive for purposes of estimating short-period 
movements, for example within a decade, but could hardly distort 
perspectives seriously in dealing with trends spanning a generation.’ 





* The longer the interval of change, the greater are likely to be the differences between the age- 
composition factors affecting the change in the birth rate. In contrast, constant “population weights” of 
unity are used in deriving the gross reproduction rate (see footnote 2 and also Tables 3 and 4 below). 

7 Thus if the gross reproduction rate declined by 30 per cent in country A and by 33 per cent in B, 
it is quite possible that the trends of the birth rates would make the change appear to be more rapid 
in A. However, differences as small as 3 or even 5 percentage points over periods as long as 30 years are 
not likely to be singled out for special attention in comparative studies. In fact, such differences may 
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Thirdly, long-run movements of the two measures are very likely to 
have the same direction. No instance of a divergent 30-year movement 
was encountered for any period beginning after 1860 and the concen- 
tration of such movements in earlier periods is probably attributable 
in good part to inaccurate data. So far as the present materials show, 
a practically unerring rule of thumb would be that a 30-year movement 
in the birth rate which exceeds 5 per cent implies a similar direction of 
the 30-year trend in the gross rate. A change below 5 per cent in the 
birth rate can be taken to imply that, although the movement in the 
gross rate is of uncertain direction, its magnitude is unusually small.’ 
Obviously such arguments could be pushed too far, even if we could 
assume that the present data are more comprehensive than is in fact 
the case. Certainly it would be well to allow for the likelihood that use 
of the birth rate as an estimator leads to rising average errors, the 
longer the interval of change. Moreover, whether the errors indicated 
here are prohibitive or tolerable depends on the objectives of estimation 
and these are rarely the same in different studies. Nevertheless, it does 
seem decisive that the suggested magnitudes are no larger than the 
errors frequently implicit in direct measures of the trend in the gross 
rate. It would be difficult to cite instances in the literature in which such 
magnitudes have been the occasion for any significant conclusions. 
This interpretation, if justified, has several important implications: 
(1) There are many contemporary populations for which birth rates are 
available but whose trends in the gross reproduction rate cannot be 
ascertained directly. Reasonable approximations are the most to be 
expected in these instances and general orders of magnitude are often 
enough. It can be expected that the main features of the unknown 
trend between any desired periods can be inferred from the direction 
and magnitude of the corresponding percentage changes in the birth 
rate. The latter changes may also serve in a second capacity, as useful 
checks on the reliability of gross rates estimated by other indirect 
methods.*® (2) Use of birth-rate changes as estimators or checks should 
prove similarly helpful in numerous historical investigations. (3) In 
most statistically advanced areas, reproduction rates are published a 
few years after the event, whereas birth rates are generally known al- 
most immediately. In such situations the birth rate may serve not only 
to estimate the change in the gross rate but also its most recent value. 





well be grouped under the same class interval in compiling distributions of international movements. 
The more important aspect, that A and B both experienced unusually rapid declines, would almost 
certainly be brought out by their birth rates. 

8 In all but one divergent movement, the birth rate varied by less than 5.5 per cent. Conversely, 
all such movements involved changes of 5 per cent or less in the gross rate and all but two involved 
changes below 3.5 per cent. 

® See for example [4] and [8]. 
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These applications refer mainly to the estimation of single or occa 
sional changes. There is impressive evidence that the birth rate can 
also be used to reconstruct the salient aspects of entire time series of 
gross rates. Thus among the eleven countries with 20 or more values 
on record, the coefficient of linear correlation between the two measures 
was .93 in one instance, .98 to .99 in four cases, and .99 or over in the 
remaining six. The implications of these results for purposes of estima- 
tion are obvious. Perhaps equally interesting is their bearing on tra- 
ditional methodology. Judging from the past, at least, our substantive 
knowledge of movements in the gross reproduction rate would have 
been very nearly the same, had it been necessary to rely on the birth 
rate alone. It is slight exaggeration to say that the theory underlying 
the gross rate has been less important as a source of new empirical 
findings than as a basis for reinterpreting the trend of the birth rate.'° 

2. The net reproduction rate. Since changes in this measure reflect 
variations in both fertility and mortality, the crude rate of natural 
increase (birth rate minus death rate) suggests itself most immediately 
as the appropriate counterpart. The evidence is convincing, however, 
that this is much too simple a view. The present data permitted a 
pairing of some 430 changes between successive values of the two rates; 
most of the variations were over short intervals, averaging under 5 
years. With respect to direction, the changes are quite similar. Diver- 
gent movements were encountered in only 1 of 5 instances and often 
involved comparatively small changes on the part of both measures. 
Moreover such movements would have been much less frequent, had 
the published net rates been more accurate. In many cases these were 
computed on the basis of non-current mortality measures (the L; of 
footnote 2) ; as a result, the published rate was often declining when the 
true rate was in fact rising along with the crude rate. Examination of 
the paired variations over 30-year periods, which were much less 
affected by these distortions, showed a considerably lower proportion 
of divergencies, 1 in 8. 

The crude rate of increase seems even more appropriate for judging 
whether the net rate is above or below unity. As observed earlier, 





10 The size of the above correlations and the small differences between the paired short-run changes 
discussed previously are by no means unrelated. This can be suggested most easily by assuming the birth 
and reproduction rates are differentiable functions of time, say z(t) and y(t), respectively. Since y =O 
when z =0 (i.e., when no births occur), perfect correlation between the two measures would imply 

dy y dy dz 
y=cz, —=e=—», and — =—» 
dz z y'dt zx-‘dt 
or zero differences between the paired percentage variations over time. Conversely, the last condition 
implies perfect correlation, with the regression line passing through the origin. In approximate terms, 
therefore, very high correlations between the two measures are necessary and sufficient conditions for the 
expectation that their short-run percentage changes will be closely similar. 
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this question has attracted especially widespread interest in the past, 
Of the approximately 160 net rates below unity in the present com- 
pilation, 60 per cent were paired with a crude rate whose absolute value 
was below 5 per 1,000, and 94 per cent with one below 8 per 1,000. 
And conversely, 5 out of 6 crude rates below 5 per 1,000 and 3 of 4 
below 8 per:1,000 were paired with fractional net rates. 

On the other hand, crude rates of increase are likely to be of little 
or no value for estimating magnitudes of change. The distribution of 
absolute differences between the 430 paired changes showed a mean of 
51 percentage points and a median of nearly 9 points. The major ex- 
planation of the huge mean difference was that many of the crude 
rates were close to zero. In such instances even small absolute changes 
could represent tremendous relative movements, often amounting to 
several hundred per cent. Thus the mean relative change in the crude 
rate was over 55 per cent, in contrast to the average of only 6 per cent 
for the much more stable net rate. After eliminating all crude rates 
under 5 per 1,000, it was found that the remaining paired changes 
had a mean difference of about 10 percentage points; the median also 
dropped appreciably, to somewhat less than 6 points. As a practical 
matter, however, it is doubtful that even this restricted use of the 
crude rate has much value, since the differences to be expected appear 
far too high relative to average changes in the net rate. And in any 
event, low rates of natural increase, with a resulting instability in their 
percentage variations, will continue to occur frequently in the future. 

Accordingly, several attempts were made to pair the changes in the 
net rate with more stable estimators. As a first approximation, use 
was made of the birth rate, although it could be anticipated that the 
estimates would be biased. Mortality trends being very persistently 
downward among the populations in the sample, the net rates tend to 
rise more or fall less than if affected by fertility alone.'! As a result, 
the percentage variations in the birth rate tend to be algebraically 
lower than those in the net rate. This is brought out clearly by Figure 2. 
Among the paired changes moving in the same direction (first and third 
quadrants), the movement in the birth rate was more negative or less 
positive in the large majority of instances. Similarly, two thirds of the 
divergent movements involved a rise in the net rate and a fall in 
the birth rate. 





1 Any decline in mortality before the end of the childbearing period of life has the effect of raising 
the net rate; any rise lowers it. In contrast, the effects of mortality trends on the birth rate tend to be 
both indirect and delayed, since they operate mainly through changes in the sex-age distribution of the 
population. 
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Nevertheless, the association between the two sets of variations is 
striking on the whole, as can also be seen from the middle column in 
Table 1. Nearly half of the paired changes differed by less than 1 per- 
centage point and two thirds by less than 2 points. Errors of these 
magnitudes are well within the tolerance limits required for most pur- 
poses of estimation. In specific problems, moreover, the likely direction 
of error can often be taken into account. Thus if the birth rate is rising 
and the death rate falling, it will generally be safe to assume that the 


Fic. 2.—Tabulation of matched percentage changes in the net reproduction 
rate and crude birth rate.* 


5 


Percentage change in net reproduction rate CY) 


-19 





Percentage change in crude birth rate (X) 


* Using changes between all successive periods on record in 27 countries. Changes of over 15 per 
cent in either measure occurred in about 1 of 10 cases. Estimating equation (all changes, ungrouped 
data): Y =1.33+1.05 X 
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percentage change in the former measure is below the change in the 
net rate. The frequency of divergent movements was about 1 in 8 
cases and the proportion falls to 1 in 10 after eliminating the instances 
in which one of the paired measures showed no change at all. In addi- 
tion, many of the divergencies involve unusually small variations in 
both measures (see Figure 2). 

These conclusions should be interpreted in the light of the fact that 
most of the changes refer to short-term intervals, between 1 and 5 
years. Table 1 makes it clear that the usefulness of the birth rate as 
an estimator declines rather rapidly as the inteivals lengthen. The 
above errors for the total sample are appreciably smaller than the 
ones shown for the subsample of changes occurring before 1900, which 
generally cover 5- or 10-year periods. In turn, both of these sets of 
errors are well below the differences that seem indicated for 30-year 
periods. These patterns are not accidental, the basic explanation being 
the role of mortality. The longer the interval of change, the greater is 
likely to be the extent to which the net reproduction rate has been 
affected by mortality movements. The birth rate being comparatively 
insensitive to such trends, the two measures can be expected to show 
increasingly dissimilar variations with the passage of time.*? 

Somewhat surprisingly, no simple combination of birth and death- 
rate changes could be found which would overcome these difficulties. 
Three combinations were investigated, the one which in general yielded 
the smallest errors being described in the last column of Table 1." 
It will be observed that the birth rate alone seems a much more service- 
able estimator for short-run purposes. Neither the birth rate alone nor 
any simple combination with the death rate stands out as clearly 
superior in the pre-1900 changes, a result which may be attributable 
to the small number of cases. Only for the 30-year changes is there any 
strong suggestion that simple methods of allowing for mortality may 





12 So far as the populations of the West are concerned, it can also be expected that the future 
effects of mortality trends will be much smaller than in the past. Current mortality before age 45 is so 
low in these areas that the maximum effects still possible from further declines are limited. 

13 Letting B and D denote the percentage changes of the birth and death rates, respectively, the 
three combinations were 


(100 + B)(100 — D)] — 10,000 100+B 
100 100 + D 





xX 100 — 100, and B-D. 


The first of these is the one whose pairings with the net rate are described in Table 1. It will be noted 
that each combination reduces to B when D =0; also, each yields a higher value than B when D is 
negative. 

“4 Examination of all the paired 5-year changes that could be derived from the available materials 
yielded these results: an average change in the net rate of 13 per cent, a median error of 2.7 per cent 
resulting from the use of the birth rate alone, and a median error of 5.2 per cent from the combination 
discussed in the text. The corresponding values for all possible 10-year changes were 17, 4.7, and 5.3. 
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lead to sensibly smaller errors, and even here there are contrary indi- 
cations. Considering that the average 30-year change in the net rate 
was 15-16 per cent, the errors to be expected from all of the estimators 
examined here are probably too high for most purposes. 

At the same time, it should be added that the present evidence is of 
a very limited sort. Much smaller errors might be found by combining 
the changes in birth and death rates in a less simple manner, for ex- 
ample by differential weighting. The important analytical lead in this 
regard may be that, although the net rate tends to vary in fairly 
proportionate manner to a given change in the birth rate, its relation 
to the death rate is much more complicated. Pending actual investiga- 
tions, however, it is impossible to predict the kinds of errors that might 
be expected from a more sophisticated approach. 

Much more can be said about the potential usefulness of crude vital 
measures in a second connection, i.e., for inferring the over-all charac- 
teristics of time series of the net rate. Among the eight countries for 
which 20 or more values were available, the lowest coefficients of linear 
correlation between this measure and the crude rate of natural in- 
crease were .83, .87, and .91; in the remaining five areas, the coefficients 
were all between .95 and .98. The birth rate for the same countries 
yielded coefficients of .66, .88, .93, and in five instances, .95 to .99. 
The low value of .66 is for France, where a succession of wars over the 
last century has led to unusual imbalances in the sex-age distribution 
of the population. In general, therefore, the conclusions reached earlier 
about the gross reproduction rate also seem appropriate here, though 
to a lesser extent. As a contribution to time series analysis of population 
replacement, the theory underlying the use of net rates has been pri- 
marily fruitful along conceptual lines. Given the theory, much the 
same factual interpretations would have been forthcoming from other 
empirical sources. 

In summary, the results of this section suggest the following: (1) 
The crude rate of natural increase is probably of little practical value 
for estimating single or occasional percentage changes in the net 
reproduction rate. (2) On the other hand, extended time series of the 
former measure provide reliable information on at least the salient 
time-series characteristics of the latter. (3) The crude rate of increase 
also permits confident inferences on whether the net rate is close to or 
below unity and whether the net rate is rising or falling. (4) For esti- 
mating short-run percentage variations, not exceeding a decade, it is 
likely that changes in the birth and net rates are closely enough associ- 
ated to be substitutable for most practical purposes. Perhaps the main 
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reservation here is that many of the world’s statistically underde- 
veloped populations may experience very rapid mortality declines 
during the next few decades; the postwar declines found in numerous 
parts of Latin America and Asia are almost without precedent in the 
earlier vital history of Western nations. (5) No simple combination of 
birth and death rates is likely to yield better short-run estimates than 
the birth rate alone. (6) None of the estimators examined here seems 
satisfactory for dealing with changes in the net rate over intervals ex- 
ceeding a decade, unless very crude indications are sufficient. On the 
other hand, further investigation may well reveal combinations of 
birth and death rates which are more serviceable. 

Obviously, the prospects that crude vital measures can provide use- 
ful inferences about the net reproduction rate are mixed. There is 
little question, however, that such inferences will often be possible in 
the kinds of situations discussed earlier in connection with the gross 
rate. 


III. SPATIAL VARIATIONS 


The usefulness of crude vital measures for inferring contemporary 
international differences in gross and net rates has been examined for 
three periods, centering on 1933, 1937, and 1947. Ideally, the question 
should have been reviewed for a large number of periods and for signifi- 
cant sub-groupings of areas. The paucity of data was such, however, 
that only recent years could be used at all; even then the number of 
areas for which documentation exists at any one time is small and 
non-Western populations are badly underrepresented. In each year 
correlation coefficients were used as indexes of the association between 
crude and reproduction measures. 

Two aspects of the testing procedure should be noted. First, the 
areas selected for 1933 and 1937 are practically identical. This was 
done intentionally, a few countries being rejected which were docu- 
mented for one year but not the other. Thus the results may be inter- 
preted as the outcome of a crude replication experiment; the relatively 
local changes in each area between the two years correspond to sample- 
to-sample fluctuations and the inclusion of the same areas corresponds 
to repeated sampling from the same universe. Secondly, all of the 
countries for which the needed data were available were included for 
1947. About a third of the 1947 areas are not in the earlier samples 
while about a quarter of the areas in the early years do not appear in the 
later year. In view of these variations and the substantial differences 
between economic and demographic conditions before and after the 
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war, the results presented in Table 2 reflect about as much year-to- 
year variability as can be obtained. 

As with temporal variations, the spatial associations between birth 
and gross reproduction rates appear to be very close. Whether measured 
in rank order or cardinal terms, the correlations suggested by the 
present data are well over .9 and highly similar from period to period. 
If the inadequacies in coverage are not too serious, it seems clear that 
the birth rate can be confidently applied to two kinds of problems: 
(1) the reconstruction of entire comparative patterns, when direct 
information on the gross rate is lacking for most of a group of countries 
under study, and (2) the determination of the comparative status of 
individual populations, in relation to areas which provide such infor- 
mation. 

These conclusions are brought out in a more analytical way by Table 
3, which presents selected data on sex-age composition and age-specific 
fertility (corresponding to the f; and r; of footnote 1). The five popula- 
lations for which recent fertility rates are given span a broad range of 
demographic situations, while the earlier materials for Sweden and 
the Netherlands suggest the compositional changes that may take 


TABLE 2 


CORRELATION OVER SPACE BETWEEN REPRODUCTION 
RATES AND CRUDE VITAL RATES* 











No. of Coefficient of Coefficient of 
Correlates a rank order linear 
countries . . 
correlation correlation 
GRR and BR 
1933 18 .93 .95 
1937 18 .93 .98 
1947 20 .94 .97 
NRR and CRNI 
1933 17 .89 .88 
1937 18 .83 .73 
1947 20 .83 .88 
NRR and BR 
1933 17 .82 .78 
1937 18 .84 .76 
1947 20 .74 .76 





* BR =Crude birth rate; CRNI =Crude rate of natural increase; GRR =Gross reproduction rate; 
NRR =Net reproduction rate. 
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place over periods of very marked vital trends. It will be observed 
that the variations in sex-age proportions are considerably smaller 
than those in fertility. Thus the latter measures, which completely 
determine the gross reproduction rate, also dominate the birth rate. 

The expectation that the two rates will show similar patterns over 
time or space follows from these relations. Table 4 shows the birth 
rates resulting from all possible combinations of the sets of age-specific 
fertility rates and sex-age proportions in Table 3. The differences 
within each column, reflecting the influence of variable age composition, 
are usually far below the differences between the actual birth rates, 
There is no overlap at all between the ranges of values in the first four 
columns and five of the seven values in column 5 exceed the highest 
in column 4. In other words, a random selection of five values from the 
various columns would show the same or practically the same ordering 
of birth rates as was in fact the case. In turn, the ordering of gross rates 
indicated by such a selection would almost certainly correspond to the 
one suggested by the actual birth rates, which is correct. 


TABLE 3 


RATIOS OF NUMBERS OF FEMALES IN REPRODUCTIVE AGES 
TO TOTAL POPULATION AND AGE-SPECIFIC FERTILITY RATES, 
FOR SELECTED COUNTRIES AND PERIODS 








Country, al 


year 





15-19 20-24 25-29 30-34 35-39 40-44 45-49 





1. Ratio of females to total population (in per cent) 
Sweden 
1900 4.62 ’ 3.50 3.07 3.20 
1950 2.92 ; 3.75 3.81 3.82 


Netherlands 
1909 4.72 ‘ . .61 
1949 3.98 , , .40 
Japan, 1950 5.10 ; ; 41 
Venezuela, 1950 5.01 ; ; 13 
El Salvador, 1950 5.48 ; : .09 





2. Age-specific fertility rates (per 1,000 women) 
Sweden, 1950 38.9 126.0 128.0 93.6 54.9 
Netherlands, 1949 12.7 97.0 187.8 172.0 120.2 
Japan, 1950 13.3 160.3 236.9 174.8 104.2 
Venezuela, 1950 104.4 276.4 277.5 217.3 144.7 
El Salvador, 1950 120.0 296.0 320.2 249.4 146.0 
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TABLE 4 


BIRTH RATES UNDER VARYING COMBINATIONS OF SEX-AGE 
PROPORTIONS AND AGE-SPECIFIC FERTILITY RATES, 
FOR SELECTED COUNTRIES AND PERIODS* 








Age-specific fertility rates of: 





Sex-age 


- . 3 " 
proportions Seeden, Nether. Vene- El Sal 


of 1950 


lands, —_— ; zuela, vador, 


1949 1950 1950 





Sweden 
1900 : 25.3 
1950 e 26.6 


Netherlands 
1909 o- , vt ee 
1949 ; j 21 


Japan, 1950 ‘ F 28.1t 43.5 48.1 
Venezuela, 1950 ‘ ; 26.9 41.7f 46.3 
El Salvador, 1950 ; P 27.8 43.5 48 .2f 
Gross reproduction rate ; ‘ 1.76 2.66f 2.95f 





* Each figure is the sum of the products obtained by cross-multiplying the indicated row in the 
first part of Table 3 by the indicated row in the second part. 

+ Differs somewhat from actual birth rate. 

¢ Estimated, but must be very nearly accurate; no official figure available. 


Once again, the advisability of relying on similar information in the 
case of the net rate is more uncertain. The associations of this measure 
with either the crude rate of natural increase or the birth rate are 
notably lower, while the year-to-year variations tend to be larger. An 
added complication is the questionable quality of some of the data; 
the numbers of cases are so small that even a few inaccuracies may 
have substantiel effects on the indicated results. Fortunately the 
errors resulting from this factor appear to have operated mainly to 
reduce the apparent association. On the other hand, the populations 
whose data are most suspect are also the ones which represent demo- 
graphic situations outside the West. Thus, whether crude rates can be 
usefully applied to comparing net rates in the West and elsewhere 
must remain a moot point in good part. 

Nevertheless, two positive conclusions seem rather clearly indicated. 





% With only one country, Chile, removed from the 1947 data, the rank order coefficient between 
net and birth rates rises from .74 to .87 and the linear coefficient from .76 to .92. Had the net rate for 
that country been based on current mortality, the 1947 coefficients in Table 2 would have been inter- 
mediate between these values. 
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On the average and without distinction among sub-groupings of popu- 
lations, the use of crude rates for judging international variations in 
the net reproduction rate will generally rest on correlation coefficients 
of about .8 or above. These are very high levels of association by 
usual standards in social studies. Secondly, the coefficients between 
birth and net rates in Table 2 probably understate the associations 
that can be counted on in dealing with Western populations alone. 
Mortality differentials within this region are already small and seem 
certain to become smaller. For this reason alone, it appears likely that 
coefficients below .8 will be unusual in the future. In addition, current 
mortality before age 45 in nearly all of the West is so low that future 
declines will leave only slight room for differences between gross and 
net rates. In the years to come, therefore, correlations between birth 
and net rates should increasingly approach the much higher associ- 
ations in evidence today between birth and gross rates, 


Iv. CONCLUSIONS 


In a number of ways, empirical use of gross and net reproduction has 
represented something of a shotgun methodology: (1) Reproduction- 
rate theory has really been concerned with the experience of a given 
time and place. As applied, however, the theory has been assumed to 
demonstrate that the disparities between crude and reproduction 
rates are equally significant for the very different purposes of making 
spatial or temporal comparisons. (2) Most of the theory has in fact 
centered about the advantages of the net reproduction rate. The tend- 
ency in empirical work, on the other hand, has been to proceed as if the 
break with crude vital measures is similarly marked in the case of the 
gross rate. Whether this is so, and what the dimensions of a significant 
break might be, are questions which have been answered more by 
default than by design. (3) The frequency with which reproduction 
rates have been computed to the third decimal place would make it 
appear that they are to be used in very precise ways. In contrast, the 
examples most stressed by the theory have been of an ordinal kind, 
or situations in which the net rate appeared to point to imminent 
population decline and the crude rate of natural increase to continuing 
growth. 

The results presented earlier bring out the relevance of these dis- 
tinctions. There can be little question that the development of repro- 
duction-rate theory has fostered a richer conceptual interpretation of 
demographic events than was previously possible. But it also seems 
clear that the specific factual contents of such interpretations have 
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often been derivable, with sufficient precision, from much simpler 
vital measures. In particular, undue emphasis seems to have been 
placed on the contrast between positive current replacement, as indi- 
cated by current natural increase, and negative eventual replacement, 
as suggested by fractional values of the net reproduction rate. Below- 
replacement values of the net rate have far more often than not been 
accompanied by current rates of increase which are unusually low by 
historical standards. And conversely, the large majority of such current 
rates could have been used to infer that the net rate was below unity. 
The net rate measures eventual population change only on the as- 
sumption that age-specific fertility and mortality remain the same 
over lengthy periods, usually two generations at least. Given the same 
assumption, the onset of negative crude rates of increase could be 
deduced from much shorter projections of current behavior into the 
future. 

Issues in methodology as broad as the ones raised here can never, 
of course, be resolved definitely. Two reservations in particular need 
emphasis in interpreting the previous conclusions. First, any judgment 
as to what may or may not constitute significant errors is inevitably 
subjective in some degree. The criteria used above seem pertinent to 
the bulk of demographic measurement but others could plausibly be 
advanced, at least for some purposes. The second and more important 
point is the underrepresentation of non-Western materials in the pres- 
ent evidence. The nineteenth-century experience on record serves as a 
partial substitute for contemporary non-Western situations; this is 
one of the reasons why it has been singled out for special attention. 
Nevertheless, this experience relates predominantly to Western Europe 
and it is advisable to exercise caution in applying the above findings 
to the statistically underdeveloped areas of today. 
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DIFFERENTIAL MORTALITY, GENERAL AND CAUSE- 
SPECIFIC IN BUFFALO, 1939-41 


CoONSTANTINE A. YERACARIS 
The University of Buffalo 


INTRODUCTION 


EMOGRAPHIC phenomena provide the social scientist with relatively 
D precise and objective data. Research shows such phenomena 
closely related to social and cultural factors. Age, nativity, race, sex 
and socio-economic status relate to mortality [2, 5] the relation varying 
with the cause of death [1, 8, 9]. 

This paper presents further evidence of variation of demographic 
phenomena with socio-economic status. It summarizes highlights of a 
study [12] undertaken to combine and retest previous methodological 
and substantive findings concerning mortality in a metropolis. It pre~ 
sents for the city of Buffalo, New York: (1) differentials in mortality 
from all causes; (2) cause-specific mortality differentials, and (3) infant 
mortality differentials. 


PROCEDURES AND METHODS 


Data on deaths in 1939-41 of Buffalo residents were secured from 
records of the Division of Vital Statistics of Buffalo and from records 
of the New York State Health Department. Death rates were com- 
puted by averaging the number of all age-specific deaths of Buffalo 
residents for the three-year period and relating them to the age, sex, 
race and nativity composition of the population of Buffalo as reported 
in the 1940 Population Census. Infant mortality rates were based on 
the three-year average number of deaths of those under one year of 
age applied to the average number of resident births for the same 
period, adjusted to the year 1938. 

Crude, standardized, and age-specific death rates and life expec- 
tancies have been basic tools of this mortality analysis [12]. In addition, 
for a simpler summarizing measure of socio-economic differentials in 
mortality, the simple, general, and proportional-cause-specific indexes 
of differential mortality were used. These indexes are a modification of 
Westergaard’s “method of expected cases” [11]. The simple index 
measures the proportion of total deaths (from all or specific causes) 
within a given economic group in “excess” because of differences in 
mortality rates between this group and the highest economic group, 
with age held constant between these groups. It is the ratio of the 
difference between observed and expected deaths in this given group 
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(with age-specific mortality rates of the highest group applied to the 
age composition of this group) and the total observed deaths. For ex- 
ample, the simple index of Group II is given by 


i) 
> mJn! 
a=0 


1— 





> men! 

a=0 
where the superscripts denote socio-economic groups, the subscripts 
denote age groups, m denotes mortality rates, and n denotes the number 
in an age group. 

The general index measures the proportion of all deaths (from all or 
specific causes) of the population as a whole which would have been 
“saved” if the age-specific mortality rates of the group with the lowest 
mortality prevailed in all groups. 

The proportional cause-specific index measures the proportion of 
deaths from all causes that would have been “saved” if there were no 
economic differences in mortality from a specific cause. It is the ratio 
of the difference between the observed and the expected deaths from 
a given cause (with the death rates from the given cause within the 
highest economic group prevailing throughout the age composition of 
all economic groups) to the total observed deaths from all causes in 
all groups. 

The five socio-economic groups for which rates were presented were 
obtained by classifying the 72 census tracts in the city according to 
the median (contract or estimated) rental of residences in the tracts. 
The method was originally used by Hauser [3] and later utilized by 
Convis [1], Mayer [5] and Kitagawa [4] in their studies on mortality 
and fertility differentials for Chicago. The original rental intervals for 
Chicago (1930) were: Group I (highest), $75.00 and over; II, $60.00 
to $74.00; IIT, $45.00 to $59.00; IV, $30.00 to $44.00, and V (lowest), 
under $30.00. 

Since these rental intervals were not applicable to Buffalo for 1940, 
the 1940 population of Buffalo was divided into five groups in as 
nearly as possible the same percentage distribution as the Chicago 
intervals. The 1940 census figures for co..tract or estimated median 
rentals provided the following socio-economic intervals for Buffalo: 
Group I (highest) $42.50 and over; II, $31.90 to $42.00; III, $24.70 
to $31.89; $20.00 to $24.69; and V (lowest), under $20.00. The numbers 
of deaths or births were allocated to these five economic groups on the 
basis of the residence address of the deceased or of the mother of the 
newborn child. 
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DIFFERENTIALS IN MORTALITY FROM ALL CAUSES 


The general differences in age-specific mortality rates were found to 
be similar to those established in the literature. The following conclu- 
sions may be drawn from an examination of these rates: 

(1) Generally the death rate for females for the city as a whole is 
lower than the death rate for males. This is true for native whites and 
Negroes and for all groups with the exception of the foreign born in 
certain economic groups (Table 1). 

(2) There is an inverse relationship between socio-economic status 
and mortality rates standardized for age, for both sexes, for Buffalo, 
1940. (Table 1). 

(3) Of the total deaths in Buffalo in 1940 (Table 2), 19.1 per cent 
would not have taken place if all economic groups had the same age- 
specific death rates as the highest economic group (general index). 
Death “wastage” due to socio-economic differenees was greater for 


TABLE 1 


TOTAL MORTALITY RATES BY SEX, NATIVITY, COLOR AND 
SOCIO-ECONOMIC GROUPS, BUFFALO, 1940; RATES 
STANDARDIZED FOR AGE 


(Per 1,000) 








Socio- Total Native Foreign 


Be eset White White White 
roup 





Male 





All groups 

I (high) 
II 
II 
IV 
V 








All groups 
I (high) 
II 

III . 
IV 10. 
V 12. 


1 
1 


10. 15.9 























*In this and subsequent tables, in addition to total rates for Negroes, only Group V rates for 
Negroes are presented inasmuch as the numbers in the other groups were too small to yield stable rates. 








1238 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1955 


females than for males. The proportion of deaths which would not have | 


occurred if there were no mortality differences due to economic status 
increased inversely with the status group (simple indexes). 

(4) Life expectation at birth for both sexes was inversely related to 
economic status (Table 3). In comparable ages, the life expectation 
of native whites, both male and female, was higher than that of foreign 
whites. The life expectation at birth for Negro males of the lowest 
economic group was more than six years lower than that of native 
white males of the same economic group and more than twelve years 
lower than that of native white males of the highest group. Expectation 
of life at birth for Negro females of the lowest economic group was 
eight years lower than that of native white females of the same group 
and about fifteen years lower than that of native white females of the 
highest economic group. 


CAUSE-SPECIFIC MORTALITY DIFFERENTIALS 


The foregoing analysis of mortality rates by socio-economic groups 
was confined to data on deaths from all causes. It would be of great 
value to investigate which specific causes of death are more or less 
“sensitive” to socio-economic differentiation. Cause-specific mortality 
analysis from a sociological point of view constitutes a method of 
refining general differences and helps clarify some of the more intangible 
socio-psychological and biological factors involved in demographic 
phenomena. 

The cause-specific analysis to follow is based on the assumption that 


TABLE 2 


SIMPLE AND GENERAL INDEXES OF DIFFERENTIAL 
MORTALITY; BY SEX, BUFFALO, 1940 











(Per 1000) 
Economic Total 
Group Population Mabe Female 
General Indexes 
All Groups 19.1 17.6 20.8 


Simple Indexes 


I (high) 0.0 0.0 0.0 
Il 6.8 2.3 11.1 
Ill 17.3 15.9 18.8 
IV 24.5 21.4 28.0 
V 38.5 36.7 41.7 
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the clinical diagnosis of the cause of death as reported on death certifi- 
cates is accurate. Considerable literature [7] indicates that significant 
errors are involved in such diagnoses. All conclusions in this study 
must, therefore, be considered with this in mind. In addition, because 
of the relatively small number of deaths, the multiple cross-classifi- 
cation of the study has in some instances resulted in small frequencies. 
This introduces a serious problem of instability in the resulting rates. 
However, analysis of mortality data in the study revealed patterns 
similar to those with more stable rates established in the literature. 

The selected causes of death [10] have shown an over-all relationship 
with economic status differentiation (Tables 4 and 5). In proceeding 
with the study of the nature and significance of this relationship, it 
must be stressed that any interpretation is somewhat impaired by the 
multifactorial determinant of mortality. Lack of early diagnosis, 
exposure, inadequacy of nutrition, lack of therapeutic knowledge and 
other cultural and socio-economic factors are found for different causes 
of death to be important in differing degrees. 


TABLE 3 


LIFE EXPECTATION AT BIRTH AND AGE 30; BY SEX, RACE, 
NATIVITY AND ECONOMIC GROUP, BUFFALO, 1940 



































Total Native Foreign _——e 
; Population White White 6 
Economic 
Group At Age At Age Age At Age 
Birth 30 Birth 30 30 Birth | 30 
Males Mi 
All groups 62.9 | 37.6] 63.5 | 38.2 37.5 54.5, | 30.8 
I (high) 65.7 | 39.8} 66.1 | 40.1 39.0 —- |< 
II 65.5 | 39.9] 65.3 | 39.7 40.5 —‘|— 
III 63.4 | 38.1] 63.6 | 38.2 37.8 — — 
IV 62.2 | 37.1} 61.6 | 36.5 38.3 -— — 
V 58.2 | 34.1] 59.7 | 35.4 35.0 53.6 | 30.4 
Females 
All groups 66.1 | 40.5 | 65.0 | 39.1 38.3 56.9 | 34.8 
I (high) 69.6 | 43.3} 70.3 | 44.4 40.6 — — 
II 68.3 | 41.8} 68.8 | 42.4 40.8 — — 
III 66.4 | 40.7| 66.8 | 41.1 40.0 — 
IV 64.8 | 39.1 | 65.6 | 39.8 38.7 — oo 
V 61.8 | 37.1] 64.1 | 38.8 36.9 55.8 | 34.1 
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The magnitude of the general indexes of differential mortality indj- 
cates that economic differences in deaths from tuberculosis, respiratory 
diseases, heart diseases and intracranial lesions are inversely related to 
socio-economic status. The rank ordering of the observed differentials 
coincided also with the degree of complexity of the cause of deaths as 
measured by the reduction of mortality from such diseases experienced 
within the past decades. On the basis of the recorded resident deaths 


TABLE 4 


SIMPLE AND GENERAL INDEXES* OF DIFFERENTIAL 
MORTALITY; BY SEX AND SPECIFIC CAUSES 
OF DEATH, BUFFALO, 1940 


(Per 1,000) 








' ° = : 
Die Intra- Heart Reapir- 





















































Economic All Tuber- Gute betes ial Dis- atory Nephri- Other 
Group Causest | culosis Melli- , Dis- tis . 
Lesions | eases Causes 
tus eases 
General Index—Total Population 
All Groups | 19.1 | 59.3 | 10.0 | 31.9 | 13.3 | 13.6 | 26.2 | 7.6 | 26.3 
Simpie Inder—Total Population 
=m cQetic 
II 6.8 25.7 7.9 23.5 2.9 2.1 —3.9 | 11.1 9.9 
III 17.3 52.6 8.5 34.4 12.0 11.6 23.0 18.8 22.5 
IV 24.5 62.3 10.8 41.5 18.2 ae 36.5 19.1 31.1 
Vv | 38.5 81.4 21.5 43.2 | 31.1 33.3 50.7 —58.8 47.5 
General Indexr— Male 
| | 
All Groups | 17.6 67.7 | 17.9 | —7.6 | 7.3 | 10.7 | 22.9 | —2.2 | 19.8 
Simple Indexr— Male 
ll 2.3 | 40.9 8.1 |-17.6 0.0 0.0 |-18.5 | 4.8 3.2 
III 15.9 61.8 19.2 4.3 v8 9.1 16.7 11.1 20.9 
IV 21.4 69.2 23.3 |-—13.3 4.5 9.9 36.2 9.3 29.7 
¥ 36.4 84.8 29.1 |-—15.4 21.7 28.8 46.8 |-—100.0 46.8 
General Index—Female 
All Groups | 20.8 | 44.8 | 2.6 | 51.9 | 18.3 | 17.0 | 31.1 | 16.2 | 27.8 
Simple Index—Female 
il 11.1 | 00 | 7.8 | 44.1 | 5.2 | 4.6 | 12.5 16.7 | 17.1 
Ill 18.8 39.1 -1.8 51.2 15.9 14.4 31.2 25.4 24.4 
IV 28.0 50.0 —3.9 63 .2 29.1 25.7 37.0 28.3 33.1 
Vv 41.7 74.2 11.1 67.7 40.9 | 25.4 57.7 —22.2 48.7 





* Economic Group I as the basis. 
t Includes infant mortality. 
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between 1900 and 1940 in the State of New York [6], for example, tu- 
berculosis, with the most spectacular state-wide reduction in mortal- 
ity, shows in this study the highest economic difference (Table 6). 
Economic differentials were less pronounced in mortality from heart 
diseases, intracranial lesions, cancer and nephritis. The observed 
relationship was curvilinear as compared to the linear relationship for 
the other causes (Table 4). Such a relationship, barring possible errors 


TABLE 5 


PROPORTIONAL CAUSE-SPECIFIC INDEXES OF DIFFERENTIAL 
MORTALITY; BY SEX, BUFFALO, 1940 


























(Per 1,000) 
Total 
Cause of Death Populati -™ Male Female 

Tuberculosis 24.8 34.0 14.6 
Cancer 13.4 21.9 3.7 
Diabetes Mellitus 12.0 —1.8 27.5 
Intra-cranial Lesions 10.2 4.8 16.3 
Heart Diseases 47.0 88.5 56.6 
Respiratory Diseases 12.6 12.3 12.9 
Nephritis 4.8 —1.2 11.5 
All other causes | 66.6 67.9 65.1 
Total | 191.4 | 176.4 208.2 

TABLE 6 


GENERAL INDEXES OF DIFFERENTIAL MORTALITY; 
BY CAUSE OF DEATH, BUFFALO, 1940 

















(Per 1,000) 
Total Population 
Causes of Death 
Index* Ratiof 
Tuberculosis 59.3 7.8 
Diabetes Mellitus 31.9 4.2 
All other causes 27.3 3.6 
Respiratory Diseases 26.2 3.4 
Heart Diseases 13.6 1.8 
Intracranial lesions 13.3 1.8 
Cancer 10.0 1.3 
Nephritis 7.6 1.0 








* Index per 1,000 deaths. 
t Ratio of each index to smallest. 
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in death certification, might well be due to the economic power of the 
upper economic group and the available free medical care for the 
lower group, with the “in-between” groups having to bear their burden 
with their own means. 


INFANT -MORTALITY DIFFERENTIALS 


In spite of medical and social progress, the first year of life still shows 
the greatest death toll of any single year. It is important to locate the 
areas with need for improvement and to find those biological or socio- 
economic factors which appear to be associated with mortality during 
this period. 

To offset problems of rate stability caused by the small number of 
deaths for each of the five economic groups, Groups I and II were 
combined, Group III remained intact, and Groups IV and V were 
combined. 

The following may be stated on the basis of the study’s findings: 

(1) Mortality rate of infants, neonatal and one to eleven months 
old, is greater for females than for males (Table 7). 

(2) There is an inverse relationship between infant mortality and 
economic status. Economic differences are present even during the 
first day after birth (for males) and become more pronounced as age 
at death of infants increases (Table 7). 


TABLE 7 


INFANT MORTALITY RATES BY SEX, RACE, AGE AND 
ECONOMIC GROUPS, BUFFALO, 1940 


(Per 100,000) 




















Under Under Total under 
Economic Group 1 day 1-20 days 1 month 1-11 months 1 year 
and Sex 
White} Negro | White] Negro | White} Negro | White] Negro | White} Negro 
Groups I and II 
Male 1519 1291 2810 630 3440 
Female 1309 855 2164 611t 2775 
Group III 
Male 1624 1624 3248 1052 4300 
Female 1103 1164 2267 1042 3309 
Groups IV and V 
Male 1574 1916t| 1778 1342t| 3352 | 3258t| 1590 1724*| 4942 | 4982 
Female 1136 | 2565T| 1134 1099¢ | 2270 | 3664t| 1325 1139* | 3595 | 4803 
All Groups 
Male 1568 1916T| 1591 1341¢| 3159 | 3257T| 1174 1727* | 4333 | 4984 
Female 1178 | 2565t| 1004 | 1092¢| 2237 | 3663t| 1043 | 1098*| 3280] 4761 
































* Less than 10 instances of death. 
t Between 10 and 24 deaths. 





TABLE 8 
NEONATAL AND INFANT DEATHS AND MORTALITY RATES BY 









































































































































he SEX, RACE AND SPECIFIC CAUSE; BUFFALO, 1940 
he (Rates per 100,000) 
en ——_—— 
ba» Under Under 1 month 1-11 months 
year 
Race Male Female Male | Female Male Female 
WS 
he Deaths | Rates || Deaths | Rates || Deaths | Rates | Deaths | Rates || Deaths | Rates | Deaths | Rates 
0- All Causes 
ig ff 
Total 598 | 4350 443* | 3349 434 | 3157 303 | 2291 164 1193 140* | 1088 
White 572 | 4333 415 | 3280 417 | 3159 | 283 | 2237 155 1174 132 1043 
of Negro 26 | 4981 26 | 4762 17 | 3257 20 | 3663 9 1724 6 | 1099 
re Respiratory Diseases 
re 
Total 77 560 56* | 423 22 160 10 76 55 400 46* | 348 
White 72 545 50 395 21 160 10 79 51 385 40 316 
Negro 5 958 4 733 1 192 “= _ 4 766 4 733 
1s 
Diarrhea 
d Total 36 262 19 144 8 58 7 53 28 204 12 91 
ie White 36 273 19 150 8 61 7 55 28 212 12 95 
N pra pate oe a ae - _ aid a a ‘ont Ea 
e egro 
Congenital Malformations 
Total 74 538 72 544 51 371 45 340 23 167 27 204 
White 74 560 71 561 51 386 44 348 23 175 27 213 
Negro — _ 1 183 _ - 1 183 _ _ - _ 
Premature Birth 
- Total 207 1506 145 1096 204 1484 142 1074 3 22 3 22 
White 191 1447 129 1020 188 1424 126 996 3 22 3 24 
a Negro 16 | 3065 16 | 2931 16 | 3065 16 | 2931 -- — -- 7 
) 
© Injury at Birth 
Total 59 429 32 242 58 422 32 242 i 7 _ _ 
White 59 447 31 245 58 439 31 245 1 8 _ _— 
Negro - - 1 183 = _ 1 183 _ “= - — 
Other Diseases Peculiar to First Year of Life 
Total 67 487 52 393 63 458 49 370 4 3 22 
a White 67 508 50 395 63 477 47 372 4 31 3 24 
Negro _ _ 2 366 — _ 2 366 Be - o- 
All Other Causes 
wae 
Total 78 568 67 507 28 204 18 136 50 364 49 371 
White 73 553 65 514 28 212 18 142 45 341 47 371 
Negro 5 958 2 366 - _ _ _ 5 958 2 366 















































* Deaths are reported for the 1939-41 period. Two deaths of females of “other races” from respiratory diseases were 
not designated as such in this tabulation. 
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TABLE 9 


TOTAL INFANT MORTALITY RATES BY SEX, RACE AND 
ECONOMIC GROUP BY SELECTED CAUSES OF 


DEATH; BUFFALO, 1940 
(Rates per 100,000) 





































































































Diseases 
All of the , Congenital 
Economic Causes Respiratory Diarrhea Malformations 
Group System 
Male |Female} Male \Female Male | Female | Male | Female 
White 
I 3623 | 2865 362* 92* gi* 92* 725* | 832* 
II 3374 | 2743 387¢ | 338* 35* | 150* 492t | 564t 
III 4340 | 3310 443¢ | 4297] 236* 61* 591t | 368 
IV 4493 | 3413 595¢ | 471¢ | 352t] 235* 514¢t | 677+ 
V 5668 | 3870 922+ | 445+ | 599+] 178* 599¢ | 534t 
Negro 
v_ | 5495 | 5252 | 1099 | sao | — | —- | — | 210 
Other Diseases 
Economic Premature Injury at Peculiar to All Other 
Group Birth Birth First Year Causes 
| of Life 
Male | Female Male IFe male | Male | Female} Male | Female 
White 
I 1087¢| 1017+} 544* 462* 453* 185* 362* 185* 
II 1160 902 422t 263* 422t 263* 457t 263* 
III 1594 1165 413t 245* 561t 398T 502t 6447 
IV 1407 | 1000 460¢ | 177* | 487t | 441f 677¢ | 412T 
V 1843 9797} 4617 | 222* | 599 | 5S78t | 645t | 934f 
Negro 
I 
v | 3297* 3151* | — | 210+} — | 420* | 1099* | 420° 











* Less than 10 instances of death. 
t Between 10 and 24 deaths. 
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(3) Economic differentials among males! were more pronounced 
than among females in neonatal mortality (Table 7). 

(4) The incidence of Negro infants’ deaths was greater than that of 
white infants. 

(5) The most frequent causes of mortality at an age less than one 
month were premature birth, injury at birth, and congenital malforma- 
tions. Mortality from premature birth and from congenital malforma- 
tions has not been reduced as drastically as that from other causes. 
It was mortality from these two causes which showed no significant 
relationship with economic group (Tables 8 and 9). 

(6) For infant mortality after ¢\e first month of life, premature birth, 
injury at birth and congenital malformations gave way to other causes 
such as respiratory diseases and diarrhea. General mortality from such 
diseases has been drastically reduced during the past 50 years. In this 
study, it was mortality from these causes that was inversely related to 
economic status. 

It will be difficult to state precisely all those factors which influence 
mortality for this age group. Yerushalmy [13] has indicated that vari- 
ations in neonatal mortality are dependent upon congenital differences 
to a larger degree than are variations in adult or infant mortality as 
a whole. He shows that mortality varies with the age of the mother 
and order of birth and that there is a greater possibility for the first 
born to experience trauma and certain conditions of congenital mal- 
formations. If these results are relevant in the present study, it may 
be that such congenital factors as order of birth, age of parents, etc., 
offset economic inequalities. This hypothesis would to a certain extent 
explain the absence of any relationship between economic status and 
mortality from congenital malformations and permature birth. 

Similarly, the existence of some direct relationship between economic 
status and mortality from injury at birth could be explained by the 
hypothesis that there was in the period 1939-41 a direct relationship 
between the proportion of first order births and economic status and 
that consequently the liability of a first born injury was directly related 
to economic status. Another hypothesis is that the better the economic 
status of the mother, the more likely is the use of instrumentation 
in childbirth as contrasted with the more prevalent use of spontaneous 
childbirth by the lower status groups, and that consequently there is 
a greater probability of injury at birth for the better economic status 


groups. 
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SUMMARY 


Socio-economic status, as measured by median monthly rentals, was 
shown to be inversely related to mortality in the city of Buffalo for 
1939-41. Such a relationship was reflected in striking differences jn 
expectation of life among various groups, and in the proportion of 
“wasted” deaths due to differences in mortality rates between the high- 
est economic group and the others. The inverse relationship was present 
even for mortality of infants under one day and was more pronounced 
for deaths at greater ages. 

Cause-specific mortality analvsis demonstrated that mortality due 
to selected causes, which have been shown to be related to environ- 
mental conditions and for which medical improvements have reduced 
the mortality considerably, exhibited the greatest economic differen- 
tials. On the other hand, mortality from premature birth and congenital 
malformations, «mong the most frequent biological causes of neonatal 
mortality, showed no relationship. 

These analyses for the city of Buffalo in 1940 have further substanti- 
ated hypotheses previously tested in other cities. This study has fur- 
ther illustrated the usefulness of rentals as a device for socio-economic 
status differentiation and has demonstrated once more that demo- 
graphic phenomena are highly sensitive to socio-economic differenti- 
ation. 
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INTRODUCTION 


HANGEs in interest rates in recent years have sometimes given rise 

to heated controversies about their economic effects and the public 
policies involved. The present study was not undertaken because of 
any special concern with such controversies but as part of a larger 
analysis of income taxation in the United States. Nevertheless, some of 
the facts revealed may be useful to those who debate policy. 

For example, it is relevant to know something about the distribution 
of interest income in discussing debt management or tax policy. A once 
popular belief was that the bulk of interest goes to a class of wealthy 
investors who are enabled to live by clipping bond interest coupons and 
receiving interest on mortgages rather than by more direct contribu- 
tions to production. How far this assumption is from the present situa- 
tion in the United States is indicated by some of the figures contained 
in this report. It will be found, for example, that the importance of 
interest as a source of personal income has shrunk strikingly in recent 
years and that much the greater part of the total interest income re- 





* This paper arose out of a study, still in process, of the personal income tax. I am indebted to vari- 
ous members of the staff of the National Bureau of Economic Research and collaborators for criticism 
and suggestions, notably to W. L. Crum, Solomon Fabricant, Raymond W. Goldsmith, Daniel M. 
Holland, C. Harry Kahn, John W. Kendrick, Geoffrey H. Moore, George Soule, and, for valuable 
statistical assistance, to Arnold Oliphant and Bella Shapiro. I am also indebted to Charles F. Schwartz 
and Selma F. Goldsmith of the Departraent of Commerce, and Marius Farioletti of the Bureau of In- 
ternal Revenue, for supplying special tabulations or making useful suggestions. The charts were drawn 
by H. Irving Forman. 

This paper has been approved for publication, as a report of the National Bureau of Economic Re- 
search, by the Director of Research and the Board of Directors of the National Bureau, in accordance 
with the resolution of the Board governing Nativnal Bureau reports (see the 35th Annual Report of the 
National Bureau of Economic Research, May *955). It is to be reprinted as No. 51 in the National 
Bureau’s series of Occasional Papers, and the Ocvasional Paper may be ordered from the National 
Bureau of Economic Research, 261 Madison Avenric, New York 16, N. Y. 
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ceived by individuals is received by those with small and moderate 
incomes, 

Another subject of debate is the influence of tax policy on the choice 
of investments, that is, on the inducement to invest in relatively safe 
interest-bearing securities as against possibly risky equities, or vice 
versa. This study offers some revealing evidence in this connection, 
indicating, among other things, the great importance of the prevailing 
expectations with respect to the possibilities of capital gains. 

Is the tendency to higher interest rates which has been in evidence 
since 1946, despite some interruptions and reversals, likely to continue, 
and, if so, are interest rates likely to rise to former peaks? On these 
questions, too, some light is thrown by analysis of the statistics and 
related considerations. 

It will doubtless surprise many persons to learn that only about one- 
fourth of the interest income estimated to have been received by indi- 
viduals in the years 1930-1950 was reported on taxable income tax re- 
turns, despite the sharp decreases during the 1940’s in the level of 
personal exemptions and the huge increase in the number of taxable 
returns filed. What are the important reasons for this fact? 

It is the aim of the National Bureau of Economic Research, in this 
as in its other studies, not to take a position on disputed issues of public 
policy but rather to make available empirical evidence and impartial 
analysis for whatever uses may be appropriate. 

The present paper begins by tracing the changing absolute and rela- 
tive importance of personal interest receipts and their changing distri- 
bution as between higher and lower income groups. Then it examines 
the principal causes of the long decline in personal interest income after 
1929 and attempts to account, so far as possible, for the fact that only 
about one-fourth of total personal interest receipts have been reported 
on taxable individual income tax returns in recent years. The third 
section of the paper presents some measures of the importance of inter- 
est income as a source of tax revenues. The fourth discusses the effects 
of the progressive income tax rate structure upon interest yields and 
interest income after taxes and the resulting influence upon incentives 
to individuals to invest in taxable fixed-interest securities. A final sec- 
tion considers some probable future trends in personal interest income. 


I, THE SHIFTING IMPORTANCE OF INTEREST 
AS A SOURCE OF PERSONAL INCOME 


1. Substantial fluctuations in the absolute and relative importance of in- 
terest have occurred since 1913 


Although interest has the reputation of being a relatively stable 
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TABLE 1 
INTEREST COMPONENT OF PERSONAL INCOME RECEIPTs, 
1913-1953 
, Total Interest : Total Interest 
income income 
receipts Mone- ——. receipts Mone- °° Pe 
Year of indi- tary wee Year of indi- tary contage 
viduals interest ° tote viduals interest of total 
(billions teeta (billions re ‘ 
of dollars) sooneys of dollars) oe c) 
° 
1913 32.5 1.4 4.4 1932 45.6 4.6 10.2 ‘ 
1914 31.9 1.5 4.6 1933 44.2 4.4 10.0 ; 
1915 34.1 1.6 4.6 1934 50.3 4.2 8.4 ‘ 
1916 40.7 1.6 4.0 1935 56.9 4.0 7.1 = 
1917 49.5 1.8 3.6 1936 = 65. 1 3.9 6.0 « 
1918 55.2 2.0 3.6 1937 70.2 3.9 5.5 
1919 63.1 2.6 4.1 1938 64.6 3.8 5.9 
1939 69.1 3.8 5.4 
1919 63.7 2.9 4.6 1940 74.8 3.7 5.0 
1920 66.9 3.3 5.0 1941 92.6 3.7 4.0 
1921 53.3 3.5 6.6 1942 118.9 3.5 2.9 
1922 57.3 3.6 6.3 1943 145.3 3.4 2.3 
1923 66.5 3.8 5.7 1944 137.4 3.5 2.2 
1924 66.9 3.9 5.8 1945 162.0 3.9 2.4 
1925 70.8 4.1 5.7 1946 172.5 4.4 2.5 
1926 73.7 4.1 5.6 1947 185.2 4.9 2.6 
1927 74.1 4.3 5.8 1948 201.6 5.3 2.6 
1928 75.9 4.6 6.0 1949 197.8 5.7 2.9 
1929 80.2 4.8 6.0 1950 219.2 6.2 2.8 
1951 245.8 6.8 2.8 
1929 80.1 5.2 6.5 1952 260.1 7.0 2.7 
1930 71.0 4.8 6.8 1953 274.3 7.5 2.8 
1931 60.3 4.8 8.0 





Source: Simon Kuznets, Shares of Upper Income Groups in Income and Savings (National Bureau of 
Economic Research, 1953), pp. 570-571, 576-577. Kuznets uses three separate series: (a) 1913-1919, 
estimates of W. I. King, The National Income and Its Purchasing Power (NBER, 1930), and an unpub- 
lished extension of the 1913 data, modified by Kuznets (Simon Kuznets, National Product in Wartime 
[NBER, 1945]); (b) 1919-1929, Simon Kuznets, National Income and Its Composition, 1919-1938 
(NBER, 1941); (c) 1929-1953, National Income Supplement, 1954, Survey of Current Business; and 
Statistical Abstract of the United States, 1958. 

Excluded from the figures used here for total personal income are imputed rent, imputed interest, 
inventory valuation adjustments, business transfers except cash prises, and government contributions 
to military family allowances. 

Interest receipts of nonprofit institutions furnishing services to individuals and current accruals of 
interest (discount) on individuals’ holdings of United States savings bonds are included, while interest 
received by individuals from other individuals are excluded, from the personal interest income figures of 
the Department of Commerce which are used here. 
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CHART 1 


INTEREST COMPONENT OF PERSONAL INCOME 
RECEIPTS, 1913-1953 
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source of income, the annual amounts received by individuals in the 
United States have varied substantially in both directions since 1913. 
This may be seen in Table 1 and Chart 1, which present annual esti- 
mates of total explicit or monetary interest receipts of individuals in 
the United States in the period 1913-1953. The figures were obtained 
by joining estimates made by Simon Kuznets for 1913-1929 with those 
of the Department of Commerce for years beginning with 1929.! 

Annual receipts of interest by individuals more than tripled between 
1913 and 1929. Then the total fell during the next fourteen years, reach- 
ing a level in 1943 about 35 per cent below that of 1929. In contrast, 
dividends paid to individuals in 1943 were only 23 per cent lower than 
in 1929, while employee compensation, entrepreneurial income, and 
rents were all much higher.” Personal interest receipts began a vigorous 
upturn in 1944, but despite a great growth in public and private debt 
they did not overtake the 1929 total until 1948. A comparison of the 
year-to-year movements of personal receipts of interest and dividends 
in 1913-1953 is shown in Table 2 and Chart 2. 

Most striking in the record of personal interest receipts is the long 
and pronounced decline in their relative importance as a component of 
total personal incomes since 1933 (Table 1 and lower section of Chart 
1). Interest had contributed a rising proportion of total personal in- 
come receipts between 1913 and 1929. In 1929, the first year of the 
present series of national income estimates of the Department of Com- 
merce, the money interest receipts of individuals constituted 6.5 per 
cent of their total income receipts. During the early years of the Great 
Depression the more drastic shrinkage of other sources of income caused 
the share of interest in total personal income to rise above 10 per cent 
in 1932 and 1933, though the absolute amount fell substantially. After 
1933 the percentage importance of interest declined sharply and almost 
uninterruptedly until it reached a low of slightly more than 2 per cent 
in 1944. Since then the proportion has fluctuated somewhat between 
23 and 3 per cent. 

The record low levels in the relative importance of interest in total 
personal income receipts in recent years resulted in largest part from 
the vastly disproportionate increases in other sources of income. 
Whereas employee compensation rose from $46 billion to $200 billion 
between 1939 and 1953 and entrepreneurial income, from $11 billion to 
$38 billion, money interest receipts of individuals rose only from $3.8 
billion to $7.5 billion.’ 

1 See note to Table 1. 

2 National Income Supplement, 1954, Survey of Current Business, Dept. of Commerce. 


3 Ibid., and special tabulation by Dept. of Commerce of net rent of owner-occupied farm dwellings, 
the amounts of which we deduct from entrepreneurial income. 
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9. Interest income more widely diffused among the different income yroups 


The figures for 1917-1948 presented by Simon Kuznets in his Shares 
of Upper Income Groups in Income and Savings and those compiled 
from income tax returns for subsequent years show that a marked re- 
duction has taken place in the concentration of interest receipts in the 
upper income groups. Correspondingly, there has been an increased 
diffusion of interest income among the population as a whole. Whereas 
the top 5 per cent of the population, when ranked according to the size 
of family income per member, received 51 per cent of the total personal 
interest income in 1919 and an average of 43.4 per cent of the total in 
1918-1938, their proportion had fallen to 26 per cent in 1948.4 The 
share of the top 1 per cent of the population fell from 37 per cent of all 
personal interest income in 1919 to 15} per cent in 1948. Conversely, 
the lower 95 per cent of the population, who had obtained only 49 per 
cent of total personal interest receipts in 1919, received 74 per cent in 
1948. 

Although the relative importance of interest as a component of per- 
sonal income declined substantially for all income groups, the decline 
was greatest in the upper income groups. In 1919 the top 1 per cent of 
the population obtained 13.4 per cent of their aggregate income from 
interest; ten years later the proportion was still 12.9 per cent; but by 
1948 it had shrunk to 4.6 per cent. The top 5 per ceni of the population, 
whose interest receipts had accounted for 10.3 per cent of their total 
income in 1919 and 10.9 per cent in 1929, obtained only 3.7 per cent of 
their total income from this source in 1948. For the lower 95 per cent 
of the population the percentage of income obtained from interest was 
2.9 per cent in 1919 and 2.2 per cent in 1948. 

These figures, as well as those from income tax returns in recent 
years, are in contrast with a still common notion that members of the 
uppermost income groups in the United States obtain a large fraction 
of their incomes by clipping and cashing maturing interest coupons on 
their bond holdings and from interest receipts on mortgages and other 
loans. The income tax figures indicate that in the ten years ending in 
1950, taxable interest contributed an average of only about 3 cents of 
each dollar of adjusted gross income reported by all taxpayers with in- 
comes exceeding $25,000. (Adjusted gross income is statutory net in- 
come before the allowable personal deductions such as interest and 
taxes paid, charitable contributions, medical expenses, and casualty 
losses.) The major sources of taxable individual incomes over $25,000 





‘Simon Kuznets, Shares of Upper Income Groups in Income and Savings (National Bureau of 
Economic Research, 1953), Table 123, pp. 647 ff. 
§ Ibid., Table 125, pp. 668 ff. 
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in 1941-1950 were dividends, salaries, and entrepreneurial income from 
unincorporated business. In 1950 the proportion that taxable interest 
contributed to the total adjusted gross income of each of nine income 
size groups was as follows: 








Income size group Per cent 





$500 ,000 and over 2.9 
$100 ,000-$500 ,000 3.0 
50,000-— 100,000 2.9 
25,000— 50,000 2.8 
10,000— 25,000 2.2 
5,000-— 10,000 0.9 
3,000- 5,000 0.4 
2,000- 3,000 0.5 
0.8 


Less than $ 2,000 





TABLE 2 


COMPARISON OF INTEREST AND: DIVIDENDS IN 
PERSONAL INCOME, 1913-1953 


(Billions of Dollars) 











Year Dividends Manetery Year Dividends 7 aneteny 
interest interest 

1913 2.1 1a 1932 2.6 4.6 
1914 2.0 1.5 1933 2.1 4.4 
1915 2.0 1.6 1934 2.6 4.2 
1916 3.3 1.6 1935 2.9 4.0 
1917 3.7 1.8 1936 4.5 3.9 
1918 3.5 2.0 1937 4.7 3.9 
1919 3.2 2.6 1938 3.2 3.8 

1939 3.8 3.8 
1919 2.9 2.9 1940 4.0 3.7 
1920 3.2 3.3 1941 4.5 3.¢ 
1921 3.0 3.5 1942 4.3 3.5 
1922 3.0 3.6 1943 4.5 3.4 
1923 3.8 3.8 1944 4.7 3.5 
1924 3.8 3.9 1945 4.7 3.9 
1925 4.4 4.1 1946 5.8 4.4 
1926 4.7 4.1 1947 6.5 4.9 
1927 5.0 4.3 1948 7.2 5.3 
1928 5.5 4.6 1949 7.5 5.7 
1929 6.3 4.8 1950 9.2 6.2 

1951 9.1 6.8 
1929 5.8 5.2 1952 9.1 7.0 
1930 5.5 4.8 1953 9.4 Ta 
1931 4.1 4.8 














Source: Same as in Table 1. 
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CHART 2 


COMPARISON OF INTEREST AND DIVIDENDS IN 
PERSONAL INCOME, 1913-1953 


Billions of dollars 
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Of the 4,410,271 tax returns reporting interest income in 1950, 3,667,607 
of them reported interest income of $500 or less, and only 27,161 re- 
ported interest income of more than $5,000 (Table 3). The latter group 
accounted for 15.8 per cent of the total interest income reported on all 
returns. 

Unlike Kuznets’ estimates the figures drawn from income tax returns 
are subject to the qualification that they do not include interest from 
tax-exempt securities. Such securities are doubtless held in relatively 
greater amounts among the upper income groups than among the lower. 
Nevertheless, as recently as 1941—the latest year for which this in- 
formation was reported on income tax returns—individuals with in- 
comes of $5,000 and over reported only 39 per cent of the state and 
local government securities estimated to be in the hands of indi- 
viduals.* After allowing for the holdings of taxable fiduciaries and for 
underreporting, George Lent’s estimates indicate that individuals with 





6 George E. Lent, The Ownership of Tax-Exempt Securities, 1913-1953 (National Bureau of Eco- 
nomic R ch. Occasional Paper 47 (1954), Table 7. Between 1924 and 1942 the law required indi- 
viduals with incomes of $5,000 or more to report on their income tax returns the amount of their holdings 
of tax-exempt securities and the amount of interest obtained therefrom. 

















1256 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1955 


TABLE 3 


ESTIMATED INTEREST ON ALL RETURNS REPORTING 
INTEREST, BY SIZE OF INTEREST INCOME 
REPORTED, 1950 














Total — 
, . interest ——— 
Bise of No. of Average income of of total 
interest returns interest each Paar 
income be aah reported group in each 
reported grou per (1) X(2) group 
(thousands of P return (thousands of (3) + total 
dollars) dollars) (per cent) 
(1) (2) (3) (4) 
Under 0.1 2,041,369 50 102 ,068 6.17 
0.1-0.2 782 , 152 150 117 ,323 7.09 
0.2-0.3 406 ,642 250 101 ,660 6.14 
0.3-0.4 267 ,209 350 93 ,523 5.65 
0.4-0.5 170,235 450 76 ,606 4.63 
0.5-1.0 409 ,048 750 306 , 786 18.53 
1.0-1.5 138,969 1,250 i73,711 10.49 
1.5-2.0 65,106 1,750 113,936 6.88 
2.0-2.5 37,753 2,250 84,944 5.13 
2.5-3.0 22,858 2,750 62,860 3.80 
3-4 26 ,829 3,500 93 ,902 5.67 
4-5 14,940 4,500 67 ,230 4.06 
5-10 20,114 6,385.6 128 ,440 7.76 
10-25 6 ,054 14,564.5 88,173 5.33 
25-30 809 33 , 735.7 27 , 292 1.65 
50-100 148 66 ,878.0 9,898 0.60 
101 or over 36 198,775.9 7,156 0.43 
Total 4,410,271 1,655,508 100.01 
Actual total reported in 
Statistics of Income 1,595,604 
Overestimate 59 ,904 
Per cent overestimated 3.75 





Source: For returns reporting interest income of less than $5,000, the average amount of interest 
per return in each interest size group was estimated by assuming that it equalled the midpoint of the 
size group. The number of returns in each group was obtained from Statistics of Income. Because of the 
wider limits of the interest size groups of $5,000 and over, the midpoint was not deemed representative 
of the average. In these groups the average amount of interest per return was estimated by assuming 
that the number of returns within each interest size group, as reported in Statistics of Income, was dis- 
tributed similarly to the distribution of returns reporting adjusted gross income of the same amounts. 


incomes of less than $5,000 owned nearly one-half of the state and local 
government securities in individual hands in 1941.7 We estimate below, 





? Unpublished estimate of George E. Lent. 
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moreover, that individuals’ interest receipts from tax-exempt securities 
fell by more than one-half between 1932 and 1949. 

In the taxable estate tax returns filed in 1949, as reported in Sta- 
tistics of Income for 1948, wholly tax-exempt securities constituted 0.5 
per cent of gross estates under $100,000, 1 per cent of those between 
$100,000 and $500,000, 3.2 per cent of those between $500,000 and 
$1 million, 8.4 per cent of $1 million to $5 million, 17 per cent of $5 mil- 
lion to $10 million, and 32.2 per cent of gross estates of $10 million and 
over. For estates under $5 million the proportions represented by tax- 
exempt securities in 1949 were considerably smaller, on the average, 
than the proportions in the years 1934-1945 and for estates of $5 mil- 
lion and over, slightly greater. 

Although the exclusion of tax-exempt interest causes the role of 
interest in the composition of income to be somewhat understated for 
all income groups in figures taken from income tax returns, and to be 
more understated for larger than for smaller incomes, the understate- 
ment is relatively small for any broad income size group. Tax-exempt 
interest doubtless bulks large in the incomes of some individuals. But 
entrepreneurial profits, dividends, capital gains, and salaries dwarf tax- 
exempt interest in importance for each group as a whole. This is true 
even of the highest income groups, which owe the bulk of their aggre- 
gate incomes to dividends, entrepreneurial profits, and capital gains.*® 
Even if, in the face of the sharp decline in total tax-exempt interest re- 
ceived by individuals and of the evidence from estate tax returns, we 
were to assume that the amount of fully tax-exempt interest received 
in 1949 by taxpayers with adjusted gross income of $100,000 or more 
was twice as great as the amount of such interest reported by these 
income groups in 1940, their tax-exempt interest would equal only 3.9 
per cent of the adjusted gross income they reported. Thomas R. Atkin- 
son found that in 1949 state and local government obligations repre- 
sented only 2.3 per cent of the total financial assets of all Wisconsin 
individuals with incomes of $50,000 or more and only 1.2 per cent of 
the financial assets of those with incomes of $20,000—50,000.° 

The lower among taxable income groups have received a surprisingly 
large porportion of the aggregate interest income reported on taxable 
individual returns in recent years. Individuals with incomes under 
$5,000 accounted for an average of 41 per cent of the annual totals of 
taxable interest in the ten years 1941-1950, and even those with in- 
comes under $3,000 accounted for an average of 25 per cent of the 
annual totals in this period (Tables 4 and 5). 

8 Kuznets, op. cit., Table 125, pp. 668 ff. 


® Thomas R. Atkinson, “The Pattern of Financial Asset Ownership” (National Bureau of Economic 
Research, mimeographed). 
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The lower among the taxable income groups have consistently re- 
ceived a much larger proportion of total interest income than of total 
dividend receipts reported on taxable returns. Taxpayers with incomes 
under $5,000 accounted for an average of only 13 per cent of the annual 
total of dividends reported on taxable returns in the 33 years 1918- 
1950 as compared with 36 per cent of the total interest. In 1941-1950 
they accounted for an average of 21 per cent of the total dividends so 
reported as against 41 per cent of total interest. 

These figures accord in a general way with Atkinson’s findings with 
respect to the pattern of financial asset ownership of individuals in 
Wisconsin in 1949.!° Atkinson found that individuals with incomes less 
than $5,000 tend to concentrate much of their financial assets in debt 
obligations, particularly such kinds as savings bank deposits, building 
and loan association shares, and life insurance policies but that corpo- 
rate stocks become progressively more important in higher income 
groups. Similarly, Joseph A. Pechman, in analyzing the patterns of 
income revealed by a study of the 1936 Wisconsin state income tax 
returns, found that interest was the second most frequent source of in- 
come for wage earners with less than $2,000 income and for entrepre- 
neurs with incomes under $1,000, as well as for entrepreneurs with in- 
comes between $6,000 and $8,000." 


3. Long decline in personal interest income after 1929 and relatively 
moderate growth in recent years in the face of large increases in debt 


It is easy to point to the major reasons for the substantial decline in 
the absolute amounts of interest received by individuals during the 
1930’s but less easy to account fully for the only moderate recovery in 
the 1940’s, when immense increases took place in the amount of inter- 
est-bearing debt outstanding in the United States. 

In the years immediately following 1929 the shrinkage in personal 
interest income reflected in part a general contraction in the total 
amount of interest payments, both to individuals and corporations. 
Such payments fell sharply in the early years of the Great Depression 
as bank and other loans were reduced and as defaults took place on 
large amounts of bonds of railroad and other domestic corporations, 
foreign governments and corporations, and on residential and farm 
mortgages. Gross annual monetary interest payments in the United 
States fell from $13.6 billion in 1929 to $8.8 billion in 1935, as estimated 





10 Tbid. 
U Frank A. Hanna, Joseph A. Pechman, and Sidney M. Lerner, Analysis of Wisconsin Income, 
Studies in Income and Wealth, Volume Nine (National Bureau of Economic Research, 1948), Chap. 2. 





TABLE 4 


PERCENTAGE SHARES OF DIFFERENT INCOME GROUPS 
IN TOTAL INTEREST REPORTED ON TAXABLE 
INDIVIDUAL AND FIDUCIARY RETURNS, 
1918-1950 








Net income classes* (thousands of dollars) 
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* In 1944 changed to Adjusted Gross Income Classes. 

Note: The figures do not necessarily add to 100 per cent due to rounding. 

Source: Statistics of Income. For the years 1918 through 1930, the figures for interest include 
annuities and dividends from foreign corporations (except those with more than 50 per cent of gross 
income from sources in the United States), both of which were lumped with interest as “Interest and 
other income” in Statistics of Income. 

For the years 1927-1931 and 1932-1936, respectively, when Statistics of Income combined 
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TABLE 5 


INTEREST REPORTED ON TAXABLE RETURNS, BY INCOME 
CLASSES, 1918-1950 


(Millions of Dollars) 








Net income classes* (thousands of dollars) 








Year 
Less , 50- 100- Over 
than 2 2-3 3-5 5-10 10-25 25-50 100 500 500 Total 

1918 99.9 165.5 241.0 220.6 199.5 120.5 92.9 113.8 51.8 1,305.6 
1919 96.7 114.9 244.4 255.5 254.6 152.3 108.6 127.2 41.1 1,395.3 
1920 111.2 186.3 276.2 268.7 274.8 156.0 105.2 81.4 22.3 1,482.0 
1921 110.5 115.8 2683.0 246.4 249.3 134.4 79.5 53.8 9.7 1,262.2 
1922 104.7 115.4 281.2 278.8 289.0 165.3 107.7 78.9 21.4 1,441.8 
1923 181.8 204.0 514.1 285.3 311.5 172.7 103.5 76.3 20.2 1,869.3 
1924 181.6 216.1 591.4 260.4 326.3 190.6 122.7 94.7 25.2 2,009.9 
1925 67.3 103.3 297.9 287.7 372.4 219.7 140.3 114.7 37.8 1,641.0 
1926 70.5 111.2 267.9 336.1 416.9 233.1 152.1 128.5 40.8 1,757.1 
1927 59.7 104.6 240.2 350.0 451.4 252.2 169.4 147.4 49.7 1,833.6 
1928 69.7 130.2 272.8 377.7 459.5 265.3 188.5 183.6 85.2 2,032.5 
192¢ 58.0 114.1 249.7 380.3 473.4 270.5 188.1 187.4 97.0 2,018.5 
19306 79.5 126.3 245.7 354.6 423.1 213.2 130.4 100.3 35.4 1,708.4 
931 46.2 84.1 162.8 269.1 310.4 130.0 75.7 47.1 15.6 1,141.0 
1932 96.7 93.9 176.0 224.6 207.0 116.2 59.2 32.1 6.4 1,012.1 
1933 94.7 83.9 145.9 184.7 164.6 900.8 47.3 29.5 6.2 847.5 
1934 61.8 55.1 127.3 197.5 204.5 93.9 44.4 23,5 6.0 813.9 
1935 56.5 53.1 121.6 193.0 204.1 99.7 51.5 28.2 5.3 813.0 
1936 88.5 77.8 159.8 172.0 186.8 101.8 62.4 42.3 11.3 902.7 
1937 88.4 71.8 153.1 177.1 187.9 96.1 56.7 39.5 11.6 882.3 
1938 135.3 77.7 158.7 177.2 172.4 74.8 35.2 25.7 9.1 866.1 
1939 136.5 78.9 157.8 178.3 179.2 84.9 42.1 31.5 10.9 900.3 
1940 183.9 132.6 170.8 169.0 169.2 80.8 40.4 30.5 10.3 987.5 
1941 269.7 187.0 207.5 161.8 161.6 76.9 40.9 33.8 9.0 1,148.2 
1942 293.5 171.9 173.5 157.1 172.7 87.9 46.3 33.3 6.2 1,142.3 
1943 259.2 136.5 158.2 168.2 179.2 94.5 53.4 39.3 7.0 1,095.4 
1944 169.1 118.2 172.5 190.4 201.4 104.2 63.3 45.9 14.7 1,079.7 
1945 148.8 116.1 155.8 193.3 214.2 111.4 66.7 47.6 14.8 1,068.6 
1946 133.7 122.0 182.4 222.9 259.5 139.4 83.7 64.3 19.7 1,227.6 
1947 136.2 126.1 185.5 234.2 260.2 138.7 80.2 65.5 19.9 1,246.5 
1948 70.0 98.4 204.0 273.2 284.4 164.0 101.1 84.5 21.5 1,301.2 
1949 99.8 126.7 260.6 335.1 311.7 173.6 104.0 81.8 21.5 1,514.8 
1950 86.1 111.6 243.1 348.7 356.0 216.3 127.1 102.1 27.5 1,618.5 





* In 1944 changed to Adjusted Gross Income Classes, 


Note: The figures do not necessarily add to total due to rounding. 
Source: Same as in Table 4, 





nontaxable returns with taxable returns for net incomes of $5,000-10,000 and $5,000-6,000, interest on 
taxable returns was estimated by assuming that it constituted the same ratio of total reported interest as 


the ratio of net income on taxable returns to total net income reported. 


For the years 1941-1943, interest was combined with dividends, rents, royalties (for 1941 only), and 
annuities on 1040A (short-form) returns. Interest was estimated for these years by assuming it consti- 
tuted the same ratio of these components of income as in 1040 (long-form) returns. 
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by the Department of Commerce. Total personal interest receipts fell 
from $5.2 billion in 1929 to $4.0 billion in 1935. The amount of personal 
interest income continued to shrink until the entrance of the United 
States into World War II and did not rise above its 1929 level until 
1948 despite a large and rapid growth in the total of public and private 
debt during the war and postwar years. 

The net amounts of public and private debt outstanding, as defined 
by the Department of Commerce but modified to exclude debt bearing 
no interest, are shown for selected years, by classes of borrowers, in 
Table 6. Net public and private debt is defined to exclude government 
obligations held by federal, state, or local governments or their agencies 
or trust funds; corporate obligations held by the issuer or its close 
affiliate; and short-term debts of individuals and nonfinancial unin- 
corporated business enterprises owed to other such enterprises and 
individuals. It also excludes bank deposits and the value of outstanding 
life insurance and annuity policies. The total fell from $160 billion in 
1930 to $142 billion in 1933. It rose slowly during the remainder of the 
1930’s, approximating its 1930 level in 1940. Then it more than doubled 
in the next five years, reaching $362 billion in 1945. The whole of this 
$202 billion increase in this period was in the federal debt and was 
mainly occasioned by World War II. The federal government then re- 
duced its debt by $24 billion between 1945 and 1953, but the net debt 
of other borrowers more than doubled, causing the total net public and 
private interest-bearing debt at the end of 1953 to be greater than that 
at the end of 1945 by $141 billion, or 39 per cent, and greater than that 
at the end of 1929 by $345 billion, or 218 per cent. 


For 1943, interest includes partially tax-exempt dividends. 

For 1944-1945, when interest and dividends were reported together on income tax returns, interest 
was estimated by assuming that the ratios between interest and dividends in 1944-1945 were the same 
asin 1946. 

Prior to 1937, interest received by fiduciaries was not tabulated separately from their other income. 
We estimated interest to constitute the following percentages of the adjusted gross income of fiduciaries 
in this period: 1922-1930, 0.33; 1931-1935, 0.40; 1936, same as 1937. For years beginning with 1937, the 
amount of interest included in individuals’ taxable income from fiduciaries was estimated to constitute 
the same proportion of their total income from fiduciaries as interest was of the adjusted gross income of 
fiduciaries. 

Two elements in these procedures work to produce small opposing errors: (1) Some understatement 
of interest occurs because long-term capital gains are included in the adjusted gross income of fiduciaries, 
thereby making the ratio of interest to their adjusted gross income lower than it otherwise would be. 
Since long-term capital gains are reported separately by the beneficiaries and are not included in income 
from fiduciaries, our estimate of interest from fiduciaries is somewhat less than it should be. (2) Some 
overstatement of interest arises, on the other hand, because a portion of interest directly reported on 
taxable fiduciary returns is distributed to beneficiaries as income from fiduciaries and is thus included 
twice in our estimates of interest. The duplication appears to be very small, however. In 1947, for exam- 
ple, if all the interest income of fiduciaries went to beneficiaries filing taxable returns—an extreme as- 
sumption, the maximum overestimation of interest from fiduciaries would be about 3 per cent. 

Excluded from our figures for 1944-1947 and 1948-1950 is the interest portion of the total of 
dividends and interest not exceeding $100 per return, sapentes Uy tingagens Ging WE setae Se Be 
former period and 1040A in the latter. 
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TABLE 6 


NET INTEREST-BEARING PUBLIC AND PRIVATE DEBT 
AT END OF CALENDAR YEARS, 1918-1953 


(Billions of Dollars) 








State a 





a Total 

Federal and Long- Farm Nonfarm Ghort- mortgage, _interest- 

govern- local term term y 

mortgage mortgage non- bearing 
ment govern- corporate corporate 
corporate debt 
ment 

1918 21 5 30 4 10 7 27 106 
1919 25 5 31 8 10 7 24 112 
1920 23 6 33 10 12 8 25 117 
1921 23 7 34 ll 13 8 25 120 
1922 23 8 34 11 14 9 25 123 
1923 21 8 36 ll 16 9 25 127 
1924 21 9 39 10 19 9 26 132 
1925 20 10 40 10 21 10 27 137 
1926 19 ll 42 10 24 10 27 142 
1927 18 ll 44 10 27 10 27 148 
1928 17 12 46 10 30 10 29 154 
1929 16 13 47 10 31 11 29 158 
1930 16 14 51 9 32 9 28 160 
1931 18 16 50 9 31 8 23 154 
1932 21 16 49 8 29 6 18 148 
1933 24 16 48 8 26 5 15 142 
1934 30 16 45 8 26 5 15 143 
1935 33 16 44 7 25 5 16 145 
1936 37 16 43 7 24 5 17 149 
1937 39 16 44 7 24 7 18 154 
1938 40 16 45 7 24 5 17 154 
1939 42 16 44 7 25 5 17 156 
1940 44 16 44 6 26 5 18 160 
1941 56 16 44 6 27 6 20 175 
1942 101 15 43 6 27 8 15 214 
1943 153 14 41 5 26 8 5 263 
1944 210 14 40 5 26 9 17 321 
1945 250 13 38 5 27 7 362 
1946 228 14 41 5 32 9 20 350 
1947 221 15 46 5 39 12 24 361 
1948 214 17 52 5 45 12 29 376 
1949 216 19 57 6 51 12 33 394 
1950 216 22 60 6 59 18 39 420 
1951 216 24 66 7 67 22 42 444 
1952 221 27 73 7 75 22 48 474 
1953 226 30 79 8 84 23 53 503 





Sources: Federal debt: Survey of Current Business, September 1953, October 1954; Annual Report 
of the Secretary of the Treasury, 1952. Matured and non-interest-bearing debt was deducted from the 
total net federal debt outstanding at the end of calendar years as reported in the Survey of Current 
Business for years beginning with 1929; for prior years the June 30 figures for matured and non-interest- 
bearing debt as reported in the Treasury's annual report were averaged to obtain December 31 esti- 
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Certain important sources of interest income of individuals are ex- 
cluded from the figures in Table 6. Among these are savings and time 
deposits in commercial and savings banks, the total amount of which 
increased from $22.1 billion to $63.6 billion between the end of 1924 
and of 1952. Except for mortgage debt the amount of interest-bearing 
obligations of individuals and unincorporated enterprises owed to non- 
corporate creditors is also excluded. 


4, Decline in interest rates 


Contributing most conspicuously to the contraction of total interest 
payments after 1932 was the prolonged decline in interest rates that 
then began. One type of interest rate was reduced to zero and a related 
one severely limited by the Banking Act of 1933 which prohibited the 
payment of interest on demand deposits by any bank insured by the 
Federal Deposit Insurance Corporation and provided for regulating 
the maximum rate of interest that could be offered by such banks on 
time and savings deposits. 

Interest rates on new issues of corporate and governmental debt 
securities and market yields on old issues fell almost continuously be- 
tween 1932 and 1946. During this period the monthly average yield on 
Moody’s Aaa corporate bonds fell from 5.01 per cent in 1932 to 2.53 per 
cent in 1946; that on Moody’s Baa bonds fell from 9.30 to 3.05 per 
cent; and the average yield of long-term Treasury bonds fell from 3.68 
for bonds issued when the interest was exempt from normal tax to 2.19 
per cent for fully taxable bonds (Table 7, Chart 3). (For corporate in- 
vestors the latter yield was reduced to about 1.66 per cent by the nor- 
mal tax of 24 per cent, and to still lower levels for corporations with 





mates, and these were deducted from the December 31 totals of net federal debt reported in the Survey 
of Current Business. 

State and local government debt: Survey of Current Business, September 1953, October 1954, 
May 1955; Indebtedness in the United States, Dept. of Commerce, 1942; Governmental Debt in 1962, 
Dept. of Commerce, 1952. Non-interest-bearing debt was deducted from the Commerce figures, the 
amounts for 1929-1941 and 1944-1952 being obtained from the Commerce volumes cited, and estimated 
for the other years. The June 30 interest-bearing debt figures were averaged to obtain December 31 
estimates. 

Long-Term Corporate, Farm Mortgage, and Non-Farm Mortgage, Survey of Current Business, 
September 1953, October 1954. 

Short-Term Corporate: R. W. Goldsmith’s unpublished NBER monograph on capital requirements, 
Statistics of Income. Figures for 1912, 1922, 1929, and 1933 represent the non-real estate loans of all 
operating commercial banks to corporations as reported by Goldsmith. Figures for the intervening 
years were obtained by straight-line interpolation. Those for 1934-1936 are interpolations of Gold- 
smith’s figures for 1933 and 1939. Figures for 1937-1952 were estimated from the balance sheet data in 
Statistics of Income, adjusted to cover corporations not submitting balance sheets. 

Nonmortgage-Noncorporate: Survey of Current Business, September 1953, October 1954; Rolf 
Nugent, Consumer Credit and Economic Stability (Russell Sage Foundation, 1939), Federal Reserve 
Bulletin, April 1953. From the total net nonmortgage-noncorporate debt figures reported by the Dept. 
of Commerce, we deducted our estimates of consumers’ non-interest-bearing charge account and service 
debt, estimates based on the data reported in Nugent, op. cit., and the Federal Reserve Bulletin. 








TABLE 7 ne 


YIELDS ON UNITED STATES TREASURY AND CORPORATE 
BONDS, 1919-1954 














Long-term Moody’s corporate bonds 
Year treasury Per 
bonds Aaa A Baa 10 
1919 4.73 5.49 6.48 7.25 9 
1920 5.32 6.12 7.41 8.20 
1921 5.09 5.97 7.28 8.35 8 
1922 4.30 5.10 6.03 7.08 
1923 4.36 5.12 6.17 7.24 7 
1924 4.06 5.00 5.93 6.83 
1925 3.86 4.88 5.55 6.27 P 
1926 3.68 4.73 5.24 5.87 
1927 3.34 4.57 5.04 5.48 
1928 3.33 4.55 5.01 5.48 
1929 3.60 4.73 5.28 5.90 
1930 3.29 4.55 5.13 5.90 
1931 3.34 4.58 6.01 7.62 
1932 3.68 5.01 7.20 9.30 
1933 3.31 4.49 6.09 7.76 
1934 3.12 4.00 5.08 6.32 
1935 2.79 3.60 4.55 5.75 
1936 2.65 3.24 4.02 4.77 
1937 2.68 3.26 4.01 5.03 
1938 2.56 3.19 4.22 5.80 
1939 2.36 3.01 3.89 4.96 
1940 2.21 2.84 3.57 4.75 
1941 1.95 Raat 3.30 4.33 
1942 2.46 2.83 3.28 4.28 
1943 2.47 2.73 3.13 3.91 
1944 2.48 2.72 3.06 3.61 
1945 2.37 2.62 2.87 3.29 
1946 2.19 2.53 2.75 3.05 
1947 2.25 2.61 2.87 3.24 
1948 2.44 2.82 3.12 3.47 
1949 2.31 2.66 3.00 3.42 
1950 2.32 2.62 2.89 3.24 
1951 2.57 2.86 3.13 3.41 
1952 2.68 2.96 3.23 3.52 
1953 2.93 3.20 3.47 3.73 
1954 2.54 2.90 3.06 3.18 





Source: Treasury Bulletin and Moody’s Investors Service as reprinted in Survey of Current Business. 
Between Jan. 1, 1919 and Oct. 15, 1925, the Long-Term Treasury Bond series included all outstanding 
partially tax-exempt bonds not due or callable for 8 years or more; between Oct. 15, 1925, and Dec. 31, 
1941, those not due or callable for 12 years or more; between the latter date and April 1, 1952, all fully 
taxable issues not due or callable for 15 years or more; since the last date, all fully taxable issues not 
due or callable for 12 years or more. 
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CHART 3 


YIELDS ON UNITED STATES TREASURY AND 
CORPORATE BONDS, 1919-1954 


Per cent 
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Source: Table 7. 


incomes subject to surtax.) Many millions of dollars of high-grade pub- 
lic utility and other corporate bonds, as well as much larger amounts of 
United States Treasury bonds, were refunded at, or in advance of, 
maturity during the latter half of the 1930’s and in the 1940’s by new 
issues carrying lower interest rates. 

Reflecting reduced interest rates, primarily, five of the seven major 
sources of private interest payments, exclusive of financial institutions— 
all but consumer credit and nonfarm mortgages—were smaller in 1948 
than in 1929, as may be seen in the figures for selected years in Table 8. 
Total interest payments from these seven sources, which had been $7.9 
billion in 1929, fell to $4.6 billion in 1940, after which they began an 
ascent which brought them to $5.6 billion in 1948 and to $10.4 billion 
in 1953. 


5. Institutionalization of investment 
One reason for the reduced level of total personal interest income in 
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TABLE 8 


MONETARY INTEREST PAYMENTS ON SELECTED 
TYPES OF DEBT, 1929-1953 


(Millions of Dollars) 








Debt of 





Pomc Debt of Net ; Total 
Mortgages portation other Same teiemed interest, Govern. 
on Farm non- Consumer selected ment 
and nm . from interest 
nonfarm mortgages pe banking credit types of interest 
utility brokers from A 
dwellings corpora- pronan private payments 
‘ tions debt 
tions 
1929 1,560 58: 1,387 2,358 443 951 577 7,858 1,506 
1930 1,549 570 1,468 2,312 444 264 608 7,215 1,513 
1931 1,571 553 1,567 2,023 366 100 550 6,730 1,521 
1932 1,538 526 1,551 1,768 259 45 426 6,113 1,574 
1933 1,425 472 1,490 1,524 196 32 324 5,463 1,689 
1934 1,336 430 1,314 1,691 221 32 242 5,266 1,849 
1935 1,275 396 1,293 1,588 276 15 207 5,050 1,831 
1936 1,232 364 1,244 1,500 386 33 195 4,954 1,868 
1937 1,214 341 1,183 1,498 490 38 160 4,924 2,019 
1938 1,192 320 1,176 1,382 458 22 138 4,688 1,920 
1939 1,167 205 1,156 1,336 500 25 127 4,616 1,941 
1940 1,172 293 1,077 1,345 616 20 120 4,643 2,059 
1941 1,217 284 1,035 1,320 743 17 126 4,742 2,088 
1942 1,238 272 1,053 1,207 530 15 130 4,445 2,407 
1943 1,184 246 1,035 1,105 326 20 115 4,031 3,141 
1944 1,147 230 978 1,097 303 26 118 3,899 3,895 
1945 1,131 220 951 1,095 328 32 130 3,887 4,934 
1946 1,204 216 838 1,122 496 25 135 4,036 5,772 
1947 1,372 222 839 1,340 784 18 168 4,743 5,751 
1948 1,592 229 873 1,548 1,085 23 224 5,574 5,904 
1949 1,783 242 959 1,735 1,310 26 230 6,285 6,196 
1950 2,061 264 $96 1,857 1,675 48 248 7,149 6,428 
1951 2,361 291 1,063 2,228 1,857 65 312 8,177 6,652 
1952 2,649 319 1,149 2,464 2,110 72 317 9,080 7,023 
1953 2,974 349 1,220 2,732 2,644 99 333 10,351 7,441 





Source: Office of Business Economics, Dept. of Commerce and National Income Supplement, 1954, 
Survey of Current Business. 


spite of the immense increase in the total outstanding public and 
private debt (Table 6) has been the continuing growth of financial 
intermediaries as holders of interest-yielding securities. Raymond W. 
Goldsmith has estimated that the main groups of financial intermedi- 
aries, excluding personal trust departments of banks but including 
government lending institutions and social security systems, increased 
their total assets by $257 billion, or more than 206 per cent, between 
1929 and 1949 (Table 9). They raised their share of the total owner- 


12 Raymond W. Goldsmith, The Share of Fi ial Intermediaries in National Wealth and National 
Assets, 1900-1949 (National Bureau of Economic Research, Occasional Paper 42, 1954), Table 1. 
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ship of each of the principal classes of interest-bearing securities out- 
standing in the United States as follows:” 








Per Cent of Total Held by 
Financial Intermediaries 








1929 1949 
United States government securities 46 64 
State and local government securities 34 53 
Corporation and foreign bonds 33 76 
Mortgages 61 70 





In consequence of this development much income that would for- 
merly have been received directly as interest income of individuals now 
takes the form of additions to various types of claims against financial 
intermediaries and of dividends from corporations. The channeling of 
savings and investment through private life insurance companies, for 
example, caused their assets—most of which consist of bonds and mort- 
gages—to increase from less than $9 billion in 1922 to $79 billion in 
1953.4 Interest-yielding investments that many individuals formerly 
made directly have in more recent years been made indirectly through 
life insurance companies. The effective exemption from personal in- 
come tax of the interest that policyholders earn on their accumulating 
life insurance reserves doubtless influences some persons to make some 
of their investments in this form rather than in the form of direct own- 
ership of fixed-interest securities. 

A development that considerably narrowed an important area of 
individual lending was the widespread adoption, under the leadership 
of the Federal Housing Administration, of the long-term amortized 
large value-ratio mortgage in the financing of residential construction 
and the consequent drastic reduction in the demand for junior lien 
financing in this field. During the 1920’s the junior mortgage or sub- 
ordinated land contract was a large source of interest income for many 
individuals. Because institutional investors then generally limited their 
first mortgage loans to 60 per cent or less of the appraised value of the 
property (the appraisals were sometimes excessively liberal, however), 
purchasers of houses commonly raised an additional 20 to 25 per cent 





3 Ibid., adapted from Tables 11, 13, 14, 16. “Outstanding in the United States” means only the 
amounts held in this country of the bonds of foreign corporations but the total outstanding issues, 
whether held in this country or abroad, of Amarican governments, corporations, and mortgages. 

4 Life Insurance Fact Book, Institute of Life Insurance. 
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TABLE 9 


TOTAL ASSETS OF MAIN GROUPS OF FINANCIAL 
INTERMEDIARIES 


(Billions of Dollars) 











Banking system: 
Federal Reserve Banks 
Commercial banks 
Mutual savings banks 
Postal Savings System 


Total 


Insurance: 
Private life insurance companies 
Frateraal life insurance organizations 
Private self-administered pension funds 
Federal, — retirement, and 
State and local social security funds 
Fire and marine insurance companies 
Casualty and miscellaneous insurance com- 

panies 

Mutual accident associations 


Total 119.53 


Miscellaneous financial intermediaries : 
Saving and loan associations 
Credit unions 
Investment companies 
Investment-holding companies 
Investment installment companies 
Mortgage companies 
Land banks 
Government lending institutions 


14.55 
0.83 
2.95 
2.44 
0.29 
0.50 
1.01 

12.31 


oroocrRNOs 
naa » 


BETSeesee 


Total 17.81 34.87 


Grand total 124.57 382 .06 





Source: Raymond W. Goldsmith, The Share of Financial Intermediaries in National Wealth and 
National Assets (National Bureau of Economic Research, Occasional Paper 42, 1954), Table 1. Slight 
discrepancies between components and totals are due to rounding. 
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by borrowing from others, largely individuals, on notes secured by a 
junior lien on the property. Under these circumstances a residential 
building boom, such as occurred during the 1920’s, could lead to a 
substantial increase in the interest income of individuals. In the build- 
ing boom that followed World War II, in contrast, institutional inves- 
tors widely adopted the long-term amortized mortgage—with or with- 
out FHA insurance—under which they were often willing to lend 80 
per cent or more of the value of the mortgaged property. In conse- 
quence, even apart from restrictions by the FHA against junior lien 
financing in connection with insured mortgages, the area for such 
financing was greatly reduced. 

The direct participation of individual lenders in financing residential 
construction was further narrowed both by their exclusion from the 
list of eligible lenders in the FHA program of insured mortgages and 
by the stimulation of commercial bank lending in this field. Whereas 
the National Banking Act imposes severe resirictions on the freedom 
of national banks to make conventional mortgage loans, and substan- 
tially similar restrictions exist in many state banking laws, mortgage 
loans insured by the FHA or guaranteed by the Veterans’ Administra- 
tion were generally exempted from these limitations. Residential mort- 
gage loans by commercial banks were thereby greatly stimulated. In 
1926-1930 commercial banks accounted for 14 per cent of home mort- 
gage loans; in 1946-1950 they accounted for 23 per cent of the much 
larger total.™ Between 1935 and 1950 they originated about one-third 
of the total amount of FHA-insured home mortgage loans and about 
one-half of the total amount of FHA-insured loans on rental and coop- 
erative housing projects."® 

The continued growth of the corporate form of doing business and of 
owning property likewise tended to reduce direct individual invest- 
ment in debt obligations. To the extent that ownership of mortgages, 
corporation and government bonds, and other debt passes from in- 
dividuals into the hands of property-managing family corporations, 
financial corporations, and ordinary business enterprises, the interest 
income becomes merged with other corporate receipts and reaches in- 
dividuals in the form of dividends and capital gains. 

Further evidence of the increased importance of financial institu- 





4 See Leo Grebler, The Role of Federal Credit Aids in Residential Construction (National Bureau of 
Economic Research, Occasional Paper 39, 1953), p. 39. 
16 Tbid., p. 38. 
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tions as recipients of interest income is found in Table 10, which com- 
pares the amounts of monetary interest (total interest less “imputed” 
interest as defined below) received by individuals with the amounts 
received by the principal classes of financial institutions, as estimated 
by the Department of Commerce. As a group these institutions in- 
creased their interest receipts from a total roughly equal to that for 
individuals in 1929 to one about 20 per cent greater by 1948 and 41 per 


TABLE 10 


MONETARY INTEREST RECEIVED BY PRINCIPAL TYPES OF FI.- 
NANCIAL INTERMEDIARIES COMPARED WITH MONETARY IN. 
TEREST RECEIVED BY INDIVIDUALS AND NON- 
PROFIT ORGANIZATIONS, 1929-1953 


(Millions of Dollars) 








Federal Mutual Life CorPor® Savings Total for 


inl OR par i - tionsin andloan Credit financial 


4 finance, associa- unions 
banks banks companies oak Siete 





5,228 
4,972 
4,570 
4,105 
3,497 
3,579 
3,483 
3,491 
3,592 
3,445 


noon or en & * & 


-— 


3,521 
3,613 
3,750 
3,688 
3,834 
4,153 
4,601 
5, 148 
5,576 
5,209 


ee ee 
- 
#£8 


LE 


S23 


+ 
oo 
x 


~ 
= 
—) 


1947 
1948 


1949 2,977 6,799 
1950 3,234 : 7,419 
1951 3,626 750 8,359 
1952 4,116 757 875 9,469 
1953 4,610 846 714 1,051 106 10,622 





Source: Office of Business Economics, Dept. of Commerce and National Income Supplement, 1954, 
Survey of Current Business. 
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cent greater by 1953. Life insurance carriers approximately doubled 
their interest receipts between 1929 and 1948, while monetary interest 
receipts of individuals in 1948 were only slightly above the 1929 level. 
The amounts for individuals include the interest receipts of nonprofit 
institutions furnishing services for individuals but exclude interest 
paid by some individuals to other individuals. 

Similar evidence is provided by the growth in the absolute amount 
and relative importance of “imputed interest” as defined and estimated 
by the Department of Commerce. Such interest consists of the amount 
of the net property income of financial institutions that they spend 
to provide services for individuals. Table 11 presents figures for 1929- 
1953 together with those for total personal interest income, which is the 
sum of the monetary and “imputed” interest receipts of individuals 
and nonprofit institutions furnishing services to them. The imputed 
interest portion of total personal interest rose from about 30 per cent 
in 1929 to 44 per cent in 1953, and its absolute amount in the latter 
year was more than 165 per cent greater than in 1929. The monetary 
interest of individuals was 45 per cent above its 1929 level in 1953; 
and its proportion of total personal interest declined from 70 to 56 
per cent between 1929 and 1953. 


TABLE 11 


TOTAL PERSONAL INTEREST, IMPUTED INTEREST PORTION, 
AND MONETARY INTEREST RECEIVED BY INDIVIDUALS 
AND NONPROFIT ORGANIZATIONS, 1929-1953 


(Billions of Dollars) 








Total ; Total 


. Imputed . Imputed 
Year personal toabnaeh Year personal ateennt 
interest interest interest '"T®S* interest 





1929 
1930 
1931 
1932 
1933 
1934 
1935 
1936 
1937 
1938 
1939 
1940 
1941 


1942 
1943 
1944 
1945 
1946 
1947 
1948 
1949 
1950 
1951 
1952 
1953 13. 
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Source: National Income Supplement, 1954, Survey of Current Business. 
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II. THE SMALL PROPORTION OF PERSONAL INTEREST RECEIPTS 
REPORTED ON TAXABLE RETURNS 


1. About one-fourth of individuals’ interest income reported on taxable 
returns in the period 1980-1950 


Although they reflect the general movements of total personal in- 
terest receipts, the annual amounts of interest reported on taxable 
individual income tax returns are also importantly influenced by fac- 
tors other than the movements of total personal interest receipts, such 
as statutory changes in the level of personal exemptions, changes in 
the distribution of income, the relative amounts of taxable and tax- 
exempt interest, etc. It is nevertheless striking to observe that the high 
point in the absolute amount of interest reported on taxable individual 
returns was reached as long ago as 1928. In that year, when only 2.5 
million taxable income tax returns were filed, the amount of interest 
reported on them was $2.0 billion. In 1950, when the number of taxable 
returns was fifteen times as large and the total amount of income on 
them seven times as large, the amount of interest they included was 


TABLE 12 


INTEREST INCOME REPORTED ON TAXABLE 
INDIVIDUAL RETURNS, 1918-1950 








“ ; Interest " Interest 
Interest — as Interest Adj _ 

on percentage on Po percentage 
taxable taxable of ad- 
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Source: Statistics of Income. The figures for total adjusted gross income are not perfectly comparable 
throughout the period. Adjusted gross income is defined in the Internal Revenue Code as gross income 
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only $1.6 billion, or about 20 per cent less than in 1928 (Tables 12 and 
13, Chart 4). 

Not only in 1950, but also in each of the preceding nineteen years, 
the amount of interest reported on taxable individual returns was less 
than that reported in every year between 1923 and 1930. 

Further, whereas interest constituted about 8.6 per cent of the 
aggregate adjusted gross income of taxable individuals in 1928 and 
averaged 9.0 per cent of such income annually between 1918 and 1933, 
its percentage importance had shrunk to 1 per cent by 1943, and has 
remained at about that level in every year since (up to and including 
1950, the latest year for which the figures are available as this is writ- 
ten). 

The amounts reported on taxable income tax returns represented 
only about one-fourth, on the average, of total personal interest re- 
ceipts in 1930-1950 (Table 13 and Chart 5). Prior to 1930 the propor- 
tion was substantially higher. 

In 1918, when the low World War I personal exemptions gave wide 
coverage to the income tax, it had been 66 per cent. After falling steeply 
to 36 per cent by 1921, it rose to 52 per cent in 1924 and then fluc- 
tuated somewhat above 40 per cent during the remainder of the 1920's. 
Then it plunged to reach a low of 19 per cent in 1933 and 1934. A 
gradual rise during the next eight years brought the proportion to 33 
per cent in 1942, after which it fell off again. It was about 25 per cent 
in each of the four years 1947-1950. 





minus allowable trade and business deductions, expenses of travel and lodging in connection with 
employment, reimbursed expenses in connection with employment, deductions attributable to rents and 
royalties, deductions for depreciation and depletion allowable to life tenants or to income beneficiaries 
of property held in trust, and allowable losses from sales of property. Each component of adjusted gross 
income, therefore, is the net income or loss from that source after making the deductions that are 
specifically allowed. A net loss from any source of taxable income constitutes a part of the adjusted gross 
income (or deficit) as well as a net profit (see Statistics of Income, 1948, p. 9). 

For years prior to 1944, when the Bureau of Internai Revenue first revised its basis of tabulating 
and reporting from a net income to an adjusted gross income basis, we constructed the adjusted gross 
income by adding together the separate components as reported in Statistics of Income, but had to ignore 
the amounts of certain negative components of income that were not reported in Statistics of Income 
prior to various years. (1) Prior to 1924, only positive components of income were reported; (2) capital 
losses from assets held more than two years were first reported in 1924; (3) other capital losses were first 
reported in 1926; (4) business and partnership losses were first reported in 1930; (5) rent and royalty 
losses were first reported in 1944; (6) prior year net losses and net operating losses though allowable asa 
deduction in 1922-1931 and 1940-1943, respectively, were not deducted in our estimates of adjusted 
gross income for those years. 

For the years 1927-1931 and 1932-1936, respectively, when Statistics of Income combined non- 
taxable returns with taxable returns for net incomes of $5,00C-10,000, and $5,000-6,000, the various 
components of income on taxable returns were estimated by assuming they constituted the same ratio 
of the total reported of the various components as the ratio of net income on taxable returns to total net 
income reported. 

For interest reported on taxable returns, see note to Table 4. 
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CHART 4 


INTEREST AS A COMPONENT OF ADJUSTED 
GROSS INCOME ON TAXABLE RETURNS, 
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The varying gaps between estimated total individual receipts of in- 
terest and the amounts reported on taxable individual income tax 
returns arise from a number of different causes. Doubtless one of them is 
that the statistical sources and methods for estimating total interest 
receipts of individuals are imperfect. The other principal causes are: 

| a. Changes in the statutory amounts of personal exemptions and 
4 credits 

b. Fluctuations in national and total personal income 

c. Deferred reporting of accrued interest on unredeemed United 
States savings bonds 

d. Changes in tax-exempt interest received by individuals and in 
/ total interest received by nonprofit organizations 
e. Underreporting of interest income for tax purposes 











TABLE 13 


INTEREST REPORTED ON TAXABLE RETURNS COMPARED 
WITH TOTAL PERSONAL MONETARY INTEREST RECEIPTS 














1918-1950 
Interest Interest 
reported reported 
Sates on taxable on on taxable 
Total returns Total returns 
monetary reported as per- monetary reported as per- 
: on taxable P on taxable 
interest cutee centage interest sstuene centage 
of total of total 
monetary monetary 
interest interest 
(billions of dollars) (billions of dollars) 
1918 1.97 1.31 66.4 1933 4.42 0.85 19.2 
1919 2.58 1.40 54.0 1934 4.24 0.81 19.2 
1935 4.04 0.81 20.1 
1919 2.93 1.40 47.6 1936 3.89 0.90 23.2 
1920 3.33 1.48 44.5 1937 3.88 0.88 22.7 
1921 3.52 1.26 35.8 1938 3.82 0.87 22.7 
1922 3.59 1.44 40.2 1939 3.76 0.90 23.9 
1923 3.77 1.87 49.6 1940 3.70 0.99 26.7 
1924 3.90 2.01 61.5 1941 3.67 1.15 31.3 
1925 4.06 1.64 40.5 1942 3.49 1.14 32.7 
1926 4.12 1.76 42.6 1943 3.39 1.10 32.3 
1927 4.30 1.83 42.7 1944 3.45 1.08 31.3 
1928 4.57 2.03 44.5 1945 3.90 1.07 27.4 
1929 4.83 2.02 41.8 1946 4.38 1.23 28.1 
1947 4.87 1.25 25.6 
1929 5.20 2.02 38.8 1948 5.26 1.30 24.8 
1930 4.80 1.71 35.6 1949 5.74 1.51 26.4 
1931 4.84 1.14 23.6 1950 6.24 1.62 25.9 
1932 4.64 1.01 21.8 











Source: For monetary interest, Table 1; for interest reported on taxable returns, Statistics of In- 
come (see note to Table 4). 
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9, Effects of changes in personal exemptions upon interest reported on 
taxable returns 


Statutory changes in the amounts of personal exemptions and credits 
have operated at times to reduce materially, or to add to, the number 
of taxpayers and the amount of income subject to income tax and, 
therefore, the total amount of interest on taxable returns. Persons with 
income under $5,000, which includes those most readily shifted out of or 
into the taxable category by changes in personal exemptions and cred- 
its, generally account for a substantial fraction of all interest reported 
on taxable returns. 

A particularly clear example of the effect on taxable interest of 
raising personal exemptions may be had by comparing the figures for 
1925 with those for 1924. An increase from $2,500 to $3,500 in the 
personal exemption for heads of families and from $1,000 to $1,500 for 
single persons was made for 1925 (and continued through 1931). The 
number of taxable returns filed in 1925 dropped 44 per cent below that 
of 1924, and their aggregate net income dropped 10 per cent, despite a 
rise in total national income in 1925. A substantial amount of interest 
income that had previously been included on taxable returns was now 
exluded. Interest reported by taxpayers with net incomes of $5,000 
or less dropped $521 million below the preceding year despite an in- 
crease of $153 million in the amount reported by the other income 
groups. The larger exemptions and credits also caused a smaller rise 
than would otherwise have occurred in interest income reported on 
taxable returns in the later 1920’s. 


3. Fluctuations in personal and national income affect the amount of in- 
terest reported on taxable returns 


More commonly, the effects of statutory alterations in personal 
exemptions, though important, have been obscured and overshadowed 
by the influence of major changes in the level of business activity and 
national income which have operated to shift large numbers of tax- 
payers out of or into the taxable category by altering the aggregate 
amounts of their incomes from all sources. In consequence, taxpayers 
with unchanged income from interest were often taxable in one year 
and not in another. Deductible capital and other losses, such as were 
widely incurred in 1921 and 1929-1933, converted many incomes con- 
taining substantial interest components into statutory net losses or 
reduced them below the allowance for personal exemptions and credits 
for dependents, while in other years some individuals whose interest 
receipts had previously gone untaxed were brought into the taxable 
group by increases in their income. from other sources. 
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For example, the reduction in taxable interest and total taxable in- 
come brought about by the increase in personal exemptions and credits 
for dependents in 1921 was accentuated by the business recession of 
that year. In consequence of both the change in law and the recession, 
the number of taxable returns fell 35 per cent, total statutory net in- 
come on them 34 per cent, and the amount of interest included on them 
15 per cent below the figures for 1920. With no change in personal ex. 
emptions and credits, an extraordinary volume of capital gains, arising 
largely from the stock market boom, together with a growth of the 
national income, were major factors in causing the statutory net in- 
come of taxable returns in 1925-1928 to rise from $17.5 billion to $21 
billion and the amount of interest income reported on them from $1.6 
billion to $2.0 billion. Again with no change in personal exemptions 
the sharp decline in business activity between 1929 and 1931 was ac- 
companied by a decline from $20.5 billion to $9.3 billion in the total 
net income of taxable returns and from $2.0 billion to $1.1 billion in the 
amount of interest reported on them. The continuing decline in per- 
sonal incomes in 1932 outweighed the effects of the reductions in 
personal exemptions enacted for that year, with the result that total 
net income and interest income on taxable returns continued to decline. 

Apart from the low absolute level of interest income reported on 
taxable individual returns during the 1940’s, the small proportion of the 
reported interest to the total amounts estimated to have been received 
is particularly noteworthy because many millions of persons and bil- 
lions of dollars were added to the income tax rolls in this period by the 
sharply rising levels of total personal income and by reductions in 
personal exemptions. The number of taxable individual income tax 
returns, which had ranged between about 2} and 53 million during the 
1920’s, and 13 to 4 million during the 1930’s, rose above 40 million by 
1943 and even higher thereafter. In these circumstances it is distinctly 
surprising that a material increase failed to take place in the propor- 
tion of total personal interest receipts accounted for on taxable returns. 
Actually, the proportion in the latter part of the 1940’s was little dif- 
ferent from that of 1939 and much lower than in the 1920’s (Table 13, 
Chart 5). 


4. United States savings discount bonds 


One factor in the failure of reported taxable interest receipts in the 
1940’s to recover to earlier levels despite the great increase in private 
holdings of debt was the large investment made by individuals since 
1935 in United States savings bonds, the accruing interest on which 
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need not be reported for taxation, if the holder so chooses, until the 
ponds are redeemed. It is probable that little of the accruing interest 
ig reported until redemption and that a substantial fraction is never 
reported.!7 

Until April 1940, Series A-D bonds were open to purchase by com- 
mercial banks and other corporations as well as by individuals. But 
each purchaser was limited to $7,500 issue price in any one calendar 
year. This restriction had the effect of largely confining subscriptions 
to individuals even before corporations were made ineligible as pur- 
chasers. Purchases of Series E bonds were open only to natural persons 
and were limited at first to $3,750 issue price in any calendar year for 
each purchaser, a limit that was subsequently raised to $10,000 ma- 
turity value and in 1952 to $20,000 maturity value. This series of bonds 
accounted for 61 per cent of the current redemption value of all the 
unmatured savings bonds outstanding December 31, 1952. 

Until replaced in May 1952 by similar issues bearing higher yields, 
Series F and G bonds were continuously open to all subscribers except 
commercial banks in amounts up to $100,000 issue price per year for 
the two issues combined ($50,000 in calendar year 1944), and commer- 
cial banks were also permitted to purchase these bonds during certain 
periods and under certain restrictions. Series G bonds, which were not 
discount bonds unless turned in for redemption in advance of maturity, 
proved far more popular than Series F and were outstanding in the 
amount of $18.4 billion current redemption value at the end of 1952 





17 United States savings bonds were first offered for sale in March 1935. Bonds of Series A~D were 
sold until May 1941, when they were succeeded by Series E, F, and G. Series A-E bonds were 10-year 
discount bonds sold at 75 per cent of maturity value to yield 2.90 per cent per annum if held to maturity 
and lesser yields if redeemed before maturity. Series F bonds were 12-year discount bonds sold at 74 per 
cent of maturity value to yield 2.53 per cent per annum if held to maturity and less if redeemed prior 
‘hereto. 

Series G bonds were twelve-year current income bonds sold at par and bearing interest at 2} per 
cent per annum, but redeemable at the option of the holder in advance of maturity at varying discounts 
from par. Further details may be found in the Treasury Bulletin, December 1944, p. 36, and February 
1946, p. 37. 

A number of changes in the savings bond offerings became effective in May and June 1952: (1) For 
Series E, the over-all interest rate was raised from 2.9 to 3 per cent without changing the offering price 
or maturity value, by shortening the maturity from 10 years to 9 years, 8 months; the redemption 
values were raised for the early and intermediate periods in advance of maturity; and the maximum 
amount purchasable by any person in any single year was raised to $20,000 maturity value. (2) A new 
Series H current income bond was introduced, available for individuals in amounts up to $20,000 ma- 
turity value annually per purchaser, with issue and redemption at par, and with current payment of 
interest cn a graduated scale at rates closely corresponding to the revised E bond scale and the same 
maturity as the revised E bonds. (3) Series F and G bonds were replaced by similar bonds, J and K 
respectively, yielding higher interest rates and eligible for annual purchases up to $200,000 maturity 
value for the two series combined for each purchaser. When Series E bonds began to mature on May 1, 
1951, owners of the matured bonds were offered an extension of maturity of 10 years or an exchange into 
Series G bonds as alternatives to cash redemption. For Series E bonds maturing after April 30, 1952, a 
10-year extension of maturity at an effective interest rate of approximately 3 per cent or an exchange into 
Series K bonds became available. 
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as against $3.8 billion of Series F bonds. Large-scale acquisition of F 
and G bonds by financial institutions and other corporations was 
prevented by the limitation on each subscriber’s annual purchases, 

The accrued discount on all unredeemed United States savings bonds 
held by iidividuals amounted to only $91 million in the aggregate in 
the first six years of the program, 1935-1940. Because sales of the bonds 
expanded after American entrance into World War II and because the 
effective interest rate on each bond rose for every additional six-month 
period that it was held (after the first year), the annual accruals of 
interest (discount) increased substantially thereafter reaching $05 
million in 1950 and $697 million in 1952 (Table 14). In the years 
1945-1950 accruals on savings bonds owned by individuals were equal 
to between 9.7 and 10.5 per cent of the annual totals of monetary 
interest received by individuals and to between 35 and 43 per cent of 
the annual totals of interest from all sources actually reported on tax- 
able individual returns. 


TABLE 14 


ACCRUED DISCOUNT OF INDIVIDUALS ON UNREDEEMED 
UNITED STATES SAVINGS BONDS, BY CALENDAR 
YEARS, 1935-1952 


(Millions of Dollars) 











Year Amount 
1935 — 
1936 3 
1937 9 
1938 16 
1939 25 
1940 38 
1941 57 
1942 78 
1943 141 
1944 253 
1945 377 
1946 440 
1947 490 
1948 554 
1949 591 
1950 605 
1951 682 
1952 697 





Source: Treasury Bulletin and letter from Treasury Department. It was assumed that individuals 
held four-fifths of the amount of Series A-D bonds, all of Series E, and two-thirds of Series F. 
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Nevertheless, the lawful postponement of income tax accounting for 
the accuring interest (discount) on unredeemed United States savings 
bonds offers only a partial explanation for the discrepancy between 
total personal interest and the amounts reported on taxable returns. 
The most extreme assumption that can be made in this direction is that 
all the unredeemed savings bonds were owned by persons filing tax- 
able returns and that the latter reported none of the current accruals 
of interest. If we add these accruals to the amounts of interest included 
on taxable returns, the proportion of total personal interest income 
that can be accounted for is raised somewhat, but only to a level aver- 
aging slightly more than 35 per cent in the 1940’s as against much 
larger proportions in the 1920’s, when the higher personal exemptions 
and credits excluded the vast majority of adults from income tax 
liability. 

Our figures for the proportion of personal monetary or explicit in- 
terest reported on taxable returns for the years prior to 1929 are only 
roughly comparable with those for subsequent years because the two 
series are derived from the work of different investigators who em- 
ployed different procedures. The two series overlap for the year 1929, 
when the proportion based on the Kuznets data was 41.8 per cent and 
that based on the Department of Commerce figures was 38.8 per cent. 
In 1950 the unadjusted proportion based on the Department of Com- 
merce figures was only 25.9 per cent; and when adjusted to include 
all accrued interest on unredeemed savings bonds, it was 35.6 per cent. 
But in 1929 the personal exemptions were such that a married man 
without dependent children paid no income tax unless his net income 
was above $3,500, whereas in 1950 he was taxable if his adjusted gross 
income was as little as $1,333. Clearly the proportion of total interest 
income reported on taxabie returns in recent years was much smaller 
than might reasonably have been expected. 


5. Interest from tax-exempt securities 


To determine how much of the discrepancy between total personal 
interest receipts and the amounts reported on taxable returns was at- 
tributable to interest received on tax-exempt securities, it was necessary 
to estimate individual receipts of such tax-exempt interest (Table 15). 

After rising from $85 million in 1913 to $651. million in 1921, the 
estimated amount of wholly tax-exempt interest received by individ- 
uals fluctuated between $565 and $636 million during the next eleven 
years. Thereafter it declined without interruption to $239 million in 
1947, after which it rose slightly and amounted to $274 million in 1952. 





TABLE 15 


WHOLLY TAX-EXEMPT INTEREST RECEIVED BY INDIVIDUALS, 
1913-1952 
(Millions of Dollars) 








Direct and 
State Wholly exempt guar. fed. 
and securities 
local Direct Postal Federal nominally 
govt. : farm only 
ye federal eavings : 
securities aa : loan partially 
securities deposits nth 
securities exempt 


(1) (2) (3) (4) (5) 








65. 
75. 
86. 
99 


18. 
18. 
18. 
18. 
46. 
70. 
70. 
80 
74. 
60 
56 
56. 
52. 
52. 
52. 
52. 
52. 
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Estimated as follows: 

Col. 1: To George E. Lent’s estimates of individuals’ holdings of state and local government securi- 
ties, as presented in his The Ownership of Tax-Exempt Securities, 1913-1962 (National Bureau of Eco- 
nomic Research, Occasional Paper 47, 1955, Appendix C), averaged for each pair of June 30 figures to 
obtain calendar year estimates, we applied interest rates obtained from estimates of the Dept. of Com- 
merce for 1913, 1922, and 1929-1950. Interest rates for the fiscal years between 1913 and 1922, and 
between 1922 and 1929 were estimated by straight-line interpolation and then averaged to obtain 
calendar year estimates. For 1929-1950, June 30 debt estimates in the Survey of Current Business, 
October 1950, September 1952, were averaged to obtain calendar year figures, and the interest rates 
were derived by using the national income estimates of state and local government interest paid. 

Col. 2: As a result of discussions with Treasury officials and others, we estimated that individuals 
received 80 per cent of the interest from the wholly tax-exempt Treasury bonds outstanding in each 
year and 80 per cent of the interest from 34 per cent Victory Liberty Loan Notes of 1919-1922; and 
that 10 ger cent of individuals’ total holdings of wholly tax-exempt obligations of the federal govern- 
ment, as estimated by Lent (op. cit.), consisted of other Treasury notes and Certificates of Indebtedness. 
The interest on wholly tax-exempt federal bonds was computed from the list of such bonds in the 
Annual Report of the Secretary of the Treasury each year, with adjustments for a part of a year when a 
security was issued or redeemed during the year. For Treasury notes (other than the Victory Liberty 
33's cited above) and Certificates of Indebtedness, also listed in the Annual Report, an average rate of 
interest for the calendar year was computed and applied to calendar year averages of individuals’ 
June 30 estimated holdings. 

Col. 3: Figures for deposits in 1913-1918 were obtained for fiscal years from the Annual Report 
of the Comptroller of the Currency and averaged to obtain calendar year estimates. Interest paid in these 
years was estimated by assuming the same rate between deposits and interest as obtained in calendar 
year 1920. Interest figures for 1919-1929 were taken from Simon Kuznets, National Income and Its 
Composition, 1919-1988, and for 1929-1941 from interest credited to depositors as reported in the 
Annual Report of the Postmaster General and averaged to obtain calendar year estimates. Tax exemption 
was withdrawn from postal savings interest on balances deposited after February 1941, and tax-exempt 
interest in the latter year was estimated at one-sixth of the total interest for the year. 

Col. 4: From wholly tax-exempt Federal Farm Loan Securities. An average interest rate was derived 
for each calendar year from Federal Land Bank and Joint Stock Land Bank bonds outstanding and 
listed in the Annual Report of the Farm Credit Administration and the Annual Report of the Federal 
Farm Loan Board. The interest rates were applied to calendar year averages of Lent’s June 30 estimates 
of Federal Farm Loan holdings by individuals (Lent, op. cit.). Interest rates for 1923-1927 and 1943 were 
estimated by straight-line interpolation and 1944 and 1945 fiscal] years were averaged to obtain a calen- 
dar year rate. 

Col. 5: From nominally only partially tax-exempt direct and guaranteed federal securities. Partially 
tax-exempt direct and guaranteed federal securities consist of bonds issued between 1917 and 1941 by 
the Treasury and several government corporations, the interest on the first $5,000 principal amount of 
which in the hands of any investor is exempt from both normal] and surtaxes, and the interest from any 
additional amounts of which is exempt only from normal tax. 

Interest rates were obtained by calendar years from the list of such securities in the Annual Report 
of the Secretary of the Treasury and were applied to calendar-year averages of Lent’s June 30 estimates of 
individual holdings of partially tax-exempt federal direct and guaranteed securities (Lent, op. cit.) less 
individual holdings of partially tax-exempt United States savings bonds (see note to Table 14). 

For the years 1924-1927 and 1932-1943, both inclusive, we estimated that the amount of partially 
tax-exempt interest enjoying full tax exemption constituted the same proportion of the total as was 
tabulated and reported in Statistics of Income for taxpayers with net incomes of $5,000, $6,000, or 
$10,000 and over in different years. Similar figures were not published for other years. We estimated the 
proportion for the years 1918-1923 by adding several percentage points to the 1924 proportion to allow 
for the fact that for several years after World War I complete exemption from income tax was extended 
for limited periods to the interest on varying principal amounts in excess of $5,000 if the holder sub- 
scribed to additional amounts of certain new Treasury issues. We estimated the proportions for the years 
1928-1931 by straight-line interpolation between the figures for 1927 and 1932 and we assumed a level 
proportion for the years 1944-1950 slightly below that for 1943. 
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For the five years 1918-1923, Treasury obligations were the most 
important source of tax-exempt interest for individuals. Although the 
interest on most of the federal securities issued during World War | 
was nominally exempt only from normal tax, except that interest on 
the first $5,000 principal amount held by any taxpayer was exempt 
also from surtaxes, complete tax exemption was granted for limited 
periods on substantial principal amounts of particular issues to pro- 
mote subscriptions. The Revenue Act of 1921 consolidated the various 
previous temporary tax exemptions of the 4 per cent and 44 per cent 
Liberty bonds into an exemption from normal and surtax of the in- 
terest from an aggregate principal amount of $125,000 for each in- 
vestor for two years after the termination of the war (July 2, 1921) 
and from a principal amount of $50,000 for three additional years. 
Such temporary exemptions were responsible for most of the tax- 
exempt interest derived by individuals from federal securities in 1919- 
1926. The effect of the reduction, and then the expiration, of the tem- 
porary grants of complete exemption was intensified by the substantial 
retirements of federal securities that took place during the twenties. 
By 1931, wholly tax-exempt interest obtained by individuals from 
direct and guaranteed federal obligations, including Postal Savings 
deposits, had shrunk to $100 million as compared with a high of $455 
million in 1920. 

The decline in wholly tax-exempt interest of individuals from federal 
obligations during the 1920’s was partly offset by a steady rise in that 
from state and local government securities. Individual receipts of such 
interest rose each year without interruption from $66 million in 1913 
to $470 million in 1932. In the following fifteen years, however, the 
annual totals fell steadily, largely because of debt retirements and 
declining interest rates, and reached a low of $217 million in 1947. 

The long period of deficit financing by the federal government in the 
1930’s and 1940’s was accompanied, after 1935, by a decline rather than 
an increase in wholly tax-exempt interest income of individuals from 
federal securities. The reasons were that the fully exempt securities 
issued by the Treasury consisted of short-term obligations command- 
ing low interest rates and these securities were mainly purchased by 
institutional investors; some previous issues of fully exempt securities 
were retired; and the federal government ceased issuing tax-exempt 
securities in 1941. 

Since the total of individuals’ receipts of tax-exempt interest fell by 
more than one-half in 1932-1950 while their total monetary interest 
income rose about 34 per cent, it seems clear that the changes in their 
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receipts of tax-exempt interest should have narrowed, not widened, 
the gap between their total receipts of monetary interest and the 
amount reported on taxable returns. 

If we subtract from the annual totals of monetary interest of in- 
dividuals the sum of their tax-exempt interest and accrued interest 
on unredeemed United States savings bonds and compare the remain- 
ders with the annual amounts of interest reported on both taxable and 
nontaxable individual returns, we obtain the percentages in column 
6 of Table 16; and if the comparison is with the amount of interest 
reported only on taxable returns, we obtain the percentages in the last 
column of the table. After these adjustments the proportion of personal 
interest income reported on individual tax returns in the 1940’s remains 
much lower than in the 1920’s, and only moderately higher than in the 
late 1930's. 


6. Underreporting of interest income 


Some part of the recent wide gap between the amount of interest 
that individuals are estimated to receive and the amount they report on 
taxable returns is doubtless due to receipts by children, retired persons, 
and others whose total net incomes are too small to be taxable. Another 
part can be attributed to nonprofit institutions, whose interest income 
in the aggregate has been growing'® and whose interest receipts are 
treated as those of individuals in the national income accounts pub- 
lished by the Department of Commerce. Excluded from taxable in- 
terest each year is the amount received by individuals whose ordinary 
income has been offset by deductible capital and business losses. There 
may also be minor undiscovered statistical offsets. On the other hand, 
the gap is understated in this respect: the figures for total monetary 
interest receipts of individuals are exclusive of interest received by some 
individuals from other individuals. But on the whole the figures strong- 
ly suggest that a significant fraction of the unreported interest esti- 
mated to have been received by individuals represents tax evasion. 

Many taxpayers among the millions added to the income tax rolls 
since World War II doubtless neglected to report interest received on 
savings bank deposits, redeemed savings bonds, building and loan 
association shares, and mortagage and other loans. The sample study 
of tax returns for 1948, conducted by the Bureau of Internal Revenue 





18 Robert Rude, in a preliminary progress report of his study of private nonprofit institutions in the 
United States, found that the endowments (other than physical facilities) of private educational institu- 
tions and private nonprofit hospitals had more than doubled between 1938 and 1947, reaching $2.3 bil- 
lion in the latter year (National Bureau of Economic Research unpublished staff progress reports, 
May 1952). 
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TABLE 16 


TOTAL MONETARY INTEREST RECEIPTS, TAX-EXEMPT In. 

TEREST, ACCRUED INTEREST ON UNITED STATES SAVINGS 

BONDS, AND INTEREST REPORTED ON TAXABLE RETURNS, 
1918-1950 








Accrued Interest not All 
Wholly interest of tax-exempt Interest reported 
Total tax-exempt individuals on or on reported on _interest 
monetary interest unredeemed unredeemed taxableand as per- 
interest received by U.S. savings savings nontaxable centage 
individuals bonds bonds returns _ of col. (4) 
(per cent) 
(Figures other than percentages are for millions of dollars) 
(2) (3) (4) (5) (6) (7) 


taxable 
returns 





286 1,681 83.5 1,306 
2,029 74.0 1,395 


2,376 63.2 1,395 
2,695 60.9 1,482 
1,262 
1,442 
1,869 
2,009 
1,641 


1,834 
2,032 


565 
565 
592 


23s 
= 
Oo i) 


$28 


3,822 
3,761 


3,701 
1941 3,672 
1942 3,494 
1943 3,391 
1944 3,451 
1945 3,903 
1946 4,376 
1947 4,874 
1948 5,256 
1949 5,737 
1950 6,243 





Source: See notes to Tables 1, 14, and 15. 
Col. 1: See notes to Table 1. 

Col. 2: See notes to Table 15. 

Col. 3: See notes to Table 14. 
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under its Audit Control Program, indicated that if all the returns of 
that year had been audited, about 2 million of them would have been 
found to contain errors in interest and that in about 1 million of them 
interest would have been found to be the major source of error.’ 
Unfortunately, the study did not yield measures of the amount of 
understatement of interest or other components of income. The fre- 
quency of interest errors was about four times that of errors in divi- 
dends. In 93 per cent of the returns where interest was the major 
source of error, there was understatement of tax liability. Although re- 
turns with incomes under $7,000 accounted for about four-fifths of 
those showing interest errors, such errors were not unimportant among 
returns with larger incomes. More than half of the estimated increase 
in tax liability estimated to be disclosable on returns in which interest 
was the major source of error was attributed to individuals reporting 
adjusted gross incomes of $7,000 or more or gross receipts from business 
or profession of $25,000 or more. 

The audit sample therefore indicates that understatement and un- 
reporting of personal interest income is not confined to the failure of 
thousands of small depositors and other small interest recipients to 
account fully for trifling individual sums. Also suggesting that sizable 
amounts of interest received from mortgages and federal and corporate 
obligations have been unreported for tax purposes is the fact that these 
sources appear to account for not far from three-quarters of the total 
direct monetary interest paid to individuals in recent years. Selma 
Goldsmith in Volume Thirteen of the National Bureau’s Studies in 
Income and Wealth ({1951], pp. 303-304) presented rough estimates, 
reproduced in Table 17 of the sources of direct interest income of 
individuals in the three years 1944-1946. Excluding tax-exempt in- 
terest, accrued interest on unredeemed United States savings bonds, 
and residuals varying in the different years between 9 and 18 per cent 





Col. 4: Equals col. 1 minus the sum of the figures in cols. 2 and 3. 

Col. 5: See notes to Table 4. For years 1918 through 1930, includes annuities and dividends from 
foreign corporations other than those obtaining more than 50 per cent of their gross income from sources 
in the United States. For 1931, interest on returns with no net income was estimated from ratio obtain- 
ing for taxable returns between interest and other income. For 1941 through 1943, when interest was 
lumped together on 1040a returns with dividends, rents, royalties, and annuities, interest income on 
these returns was estimated by assuming that it constituted the same proportion of the lumped com- 
ponents as on 1040 returns. The interest figures for 1943 include partially tax-exempt dividends. For 
1944 and 1945, when dividends and interest were lumped together in the returns, interest was esti- 
mated to constitute the same proportion of the total of dividends and interest as in 1946. 

Col. 7: See notes to Table 4. 





19 See Marius Farioletti, “Some Results of the First Year’s Audit Control Program of the Bureau 
of Internal Revenue,” National 7J'az Journal, March 1952; also The Audit Control Program (Bureau of 
Internal Revenue, 1951. 
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TABLE 17 


SOURCES OF DIRECT MONETARY INTEREST INCOME OF 
INDIVIDUALS,* 1944-1946 


(Millions of Dollars) 








Source 1944 1945 





Mortgage loans 650 670 
Federal obligations 285 575 
Corporate bonds 730 700 
Commercial bank time deposits 200 300 
Mutual savings bank deposits 200 215 
Savings and loan associations and 

credit unions 165 175 
Unallocated residual 493 450 


Totals 2,723 3,085 





® Individuals include fiduciaries and nonprofit organizations furnishing services to individuals. 

> Including postal savings deposits. 

* Direct monetary interest income of individuals was derived by deducting imputed interest, tax- 
exempt interest, and accrued interest on unredeemed United States savings bonds from the interest 
component of persons! income in the national income statistics of the Department of Commerce. 
These totals vary slightly from our figures used elsewhere in this book primarily because of small 
differences between Mrs. Goldsmith's estimates and ours of tax-exempt interest. 

Source: Selma F. Goldsmith, “Appraisal of Basic Data for Size Distributions,” Studies in Income and 
Wealth, Volume Thirteen (National Bureau of Economic Research, 1951), p. 304. 


that she could not allocate to any of the six principal sources of in- 
terest income, she found that only about one-fourth of the remainder 
of monetary interest paid to individuals represented interest on bank 
deposits and interest or dividends paid by savings and loan associations 
in the three years. Since, for the three years as a whole, only about one- 
third of the total cash interest receipts of individuals, exclusive of 
tax-exempt interest and accrued interest on unredeemed savings bonds, 
was reported on taxable returns, it would seem that significant amounts 
of interest on mortgages and corporate and govenment obligations were 
not reported. 

The figures suggest that greater underreporting of interest income 
developed after the sharp rise in tax rates and taxable incomes during 
World War ITI. In 1941 the amount of interest on taxable returns was 
35 per cent of total monetary interest received by individuals, exclusive 
of tax-exempt interest and accrued interest on unredeemed United 
States savings bonds. This percentage fell by more than 6 percentage 
points to slightly above 29 by 1948 and was a little above 30 in 1950 
(Table 16). 
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When the Treasury Department in September 1952 proposed to re- 
quire banks to file information returns with respect to all payments of 
interest amounting to $100 or more in one year, instead of requiring 
such returns only with respect to interest payments of $600 or more, 
the Wall Street Journal (Feb. 18, 1953) reported that some holders of 
savings accounts were dividing up their accounts among a number of 
banks in order that their interest receipts from any one bank would 
fall short of $100 a year. In December 1952 the Treasury withdrew the 
proposed change in its regulations because of protests from bankers 
that the change would greatly increase their paver work. 


III, TAX REVENUES FROM INTEREST 


1. Tax revenues from interest income in the 1940's far exceeded those of 
previous decades, with a growing proporiton from the lower taxable 
incomes 
Since the personal income tax is levied at the same rates on the total 

from all sources (except for tax-exempt interest and long-term capital 

gains) of an individual’s taxable income, less his personal exemptions 
and allowable deductions, his tax liability is not strictly made up of 
separate amounts attributable to each of the various components of 
his income, such as salaries, dividends, rents, etc.?° Nevertheless it is 
useful for some purposes to act as if this were the situation. In practice 
the additional tax that would be payable on an additional increment of 
income is often taken into account by investors in choosing the char- 
acter and form of new investments—as between bonds and common 
stocks and as between tax-exempt and taxable bonds, for example. 

This aspect receives attention in the next section of this paper. Because 

the composition of taxable incomes of the different income groups and 

the structure of tax rates have varied significantly over a period of 
years, it is also interesting to measure the varying average impact of the 
income tax upon the different components of personal incomes. The 
average effective tax rate for each income group can be calculated by 
dividing its total tax liability by its total adjusted gross income. By 

2% The effective tax rates on some kinds of income have differed from the formal rates in varying 
degree from time to time because of exemptions or credits applicable to particular kinds of income. A 


zero rate, in effect, has been applied without interruption to interest from state and local government 
obligations and from some obligations of the federal government and its instrumentalities. Interest from 





large amounts of securities issued by the federal government and its agencies prior to 1941 was exempt 
from normal tax, although not from surtax. An earned income credit, varying in different years between 
1924 and 1943 (not allewable in 1932 and 1933), reduced the effective tax rates on incomes received as 
compensation for personal services and with different limits, on property income also. Prior to 1936, 
dividend income was exempt from normal tax. Under the Revenue Act of 1954, the first $50 of dividend 
income received by any taxpayer was made wholly exempt from income tax, and a tax credit of 4 per 
cent of the remainder of his dividend income was made applicable against his total tax liability. 
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applying the average effective tax rate so obtained to the amount of 
interest income reported by each income group, a measure of the tax 
liability that can be attributed to the interest component of taxable in- 
comes may be obtained.” In this way interest is not treated as the 
marginal component of income, subject to the highest applicable brack- 
et rates, but treated as if it were distributed among the various 
brackets in the same proportions as all other components of income and 
subject only to the average effective tax rate paid by each person or 
income group on his or its adjusted gross income as a whole. 

Measured in this way, the amount of tax revenue supplied by the 
interest component of taxable individual incomes in 1918-1950 ranged 
from a low of $35 million in 1931 to a high of $288 million in 1950 
(Table 18). In 1918-1920 low personal exemptions, high tax rates, and 
exceptionally high interest rates (the average yield of Moody’s Aaa 
bonds was 5.49 per cent in 1919 and 6.12 per cent in 1920) contributed 
to the relatively large revenues from interest in 1918-1920 ($147, 
$135, $104 million). During 1921-1929 the net effect of more liberal 
personal exemptions, falling tax rates, and sustained or rising interest 
income and total income was to produce a fairly stable plateau of in- 
come tax revenue from interest, averaging $79 million annually. The 
influence of liberalized personal exemptions in 1921 and 1923 is clearly 
reflected in the declines of those years. 

During the 1930’s, largely because of the depressed levels of total 
personal incomes, tax revenues from interest averaged about 36 per 
cent lower than during the twenties, despite the reduction in personal 
exemptions and increased in tax rates that began in 1932. A sharp in- 
crease in income tax revenues from interest took place during the 1940's 
with the steep reductions in personal exemptions, increases in tax 
rates, and growth in the public debt that acccompanied the rearma- 
ment program and World War II. In 1950 the $288 million of income 
tax liability for interest receipts reported on taxable individual returns 
was more than six times that of 1938 and greater than that of any pre- 
vious year. 

Reflecting the progressive structure of effective income tax rates, 
inclusive of the personal exemption provisions, the great bulk of the 
income taxes attributable to interest receipts in all years between 1918 
and 1940 was levied on the upper income groups (Table 19). Tax- 
payers with incomes of $10,000 or more in this period, though receiving 
about 46 per cent on the average of the annual amounts of interest 





21 For some purposes the marginal or highest applicable bracket rate is more significant than the 
average effective tax rate, as is illustrated presently. 





INTEREST AS INCOME AND TAX REVENUE 


TABLE 18 


INTEREST TAX LIABILITY, BY INCOME GROUPS, 1918-1950 
(Thousands of Dollars) 
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Net income classes® 





5-10 


10-25 


25-50 


50-100 


100- 
500 


500 
and 
over 


Total 





909 
534 


796 
774 
987 


1,017 
1,823 
5,871 
16,011 
16,101 
11,422 
10,013 
9,442 
9,918 
4,187 
5,884 
5,445 


299 


874 
694 


991 
900 
943 


1,039 
1,294 
5,185 
11,607 
12,549 
11,462 
11,218 
10,250 
11,238 
6,625 
8,392 
7,828 


804 
756 
662 


801 
254 
595 
412 
1,837 
1,557 
1,227 
1,165 
1,667 
1,645 
1,647 


1,776 
2 ,366 
9,959 
17 ,937 
20,014 
20 ,678 
18,488 
17 ,748 
18,102 
14,970 
18,891 
18,520 


9,295 
7,104 
7,707 
6,536 
7,132 
5,562 
2,595 
1,705 
1,961 
2,190 


2,313 

970 
2,148 
1,615 
4,821 
4,059 
4,376 
4,164 
4,110 
4,131 
4,060 


4,115 
5,022 
13 ,954 
24,483 
31,746 
33 ,553 
34,285 
32,414 
33,541 
28,106 
33,618 
36,819 


17,500 
16 ,626 
17 ,031 
14,934 
15 ,697 
12,548 
9 ,382 
8,388 
9,182 
9,897 


10,402 
7,869 
8,951 
6,456 

10,186 
8,485 

10,889 

11,191 

10,896 

10,759 
9,607 


10,093 
13,661 
28,028 
42,909 
51,050 
55,344 
58,691 
61,081 
60,322 
45,612 
49,050 
58,210 


17 ,322 
17,410 
16 ,682 
14,044 
16 ,473 
12,761 
13,149 
12,984 
13 ,666 
14,900 


15 ,922 
14,364 
12,598 
7,542 
11,469 
9,271 
10,429 
11,167 
12,031 
11,042 
8,682 


9 ,966 
15,291 
23,424 
33 ,922 
39,915 
42,588 
47 ,042 
49 ,862 
49,176 
40,519 
41,747 
54,271 


20 ,367 
20,745 
18,281 
13 ,620 
18,108 
12 ,688 
15,381 
14,066 
15,029 
16,745 


19 ,076 
17,131 
13 ,085 
7,712 
9,789 
8,204 
8,401 
9,842 
12,730 
11,214 
7,110 


8,516 
12,261 
16 ,941 
23 ,296 
29 ,129 
33 ,042 
33 ,855 
38 ,284 
36 ,603 
34,847 
34,921 
44,351 


45,649 
45,849 
26 ,090 
16,595 
24,375 
16 ,372 
21,908 
17 ,422 
19 546 
22,890 


28,715 
27 ,341 
16 ,267 
7,635 
9,577 
9,175 
7,994 
9,603 
15,913 
14,657 
9,123 


11,758 
13,563 
17 ,462 
21,400 
26 ,835 
29 ,232 
28,550 
$4,909 
35,802 
38,739 
36 ,099 
47 ,083 


27 , 347 
21,290 
10,788 
4,878 
8, 33% 
5,614 
7,778 
6,886 
7,809 
% ,988 


{6,383 
18,124 
7,147 
3,329 
2,336 
2,582 
2,725 
2,448 
6,421 
6,498 
4,447 


6.040 
6,065 
5,386 
4,544 
& .317 
Aw, 168 
9,694 
11,297 
11,167 
11,827 
11,519 
15 ,528 


147 ,242 
135 ,307 
104 ,452 
76 ,618 
96 ,258 
72,831 
74,655 
62,755 
68 ,294 
76 ,737 


94,242 
86 ,245 
61,310 
35 ,074 
52 ,066 
45,118 
47 ,266 
50,736 
65 ,556 
61,620 
46 ,606 


54,320 
71,345 
126,211 
196 , 109 
232 ,657 
247 ,489 
251 ,837 
265 , 288 
265 , 867 
225 ,432 
240,122 
288 ,054 





* In 1944 changed to Adjusted Gross Income Classes. 
Source: Statistics of Income. To the estimated tax liability arrived at as described in the text, 
we added the tax liability attributable to the interest component of income from fiduciaries received 
by individuals and beneficiaries. This added tax liability was estimated by the same procedures as those 
described in the notes to Table 4. However, a slight overestimate of tax liability attributable to interest, 
of the order of 1 per cent, results from the small duplication in.the figures. 
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TABLE 19 
SHARES OF INCOME GROUPS IN TOTAL TAX LIABILITY 


ATTRIBUTABLE TO INTEREST, 1918-1950 


(Percentages) 











Net income classes* (thousands of dollars) 
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* In 1944 changed to Adjusted Gross Income Classés. 


Note: The figures do not necessarily add to 100 per cent due to rounding: 





INTEREST AS INCOME AND TAX REVENUE 1293 


reported on taxable returns, paid on the average 89 per cent of the 
annual amounts of income taxes attributable to interest. This extreme 
relationship was moderated after 1940, partly because of a change in 
the distribution of reported interest income in favor of the lower income 
groups and partly because of the relatively greater increases of effective 
tax rates on smaller than larger incomes. Between 1941 and 1949 in- 
clusive, taxpayers with incomes of $10,000 or more reported an annual 
average of 41 per cent of all the interest on taxable returns and paid 
an annual average of 71 per cent of the income taxes attributable to 
interest. 


2. Taxes on interest income declined radically as a share of total income 
taxes, but absorbed one-sixth or more of all interest reported on taxable 
returns in most years since 1942. 


As a proportion of total personal income tax liability, the amount 
contributed by interest income ranged during the twenties from 11.2 
to 8.1 per cent (Table 20). The proportion rose to 15.8 per cent between 
1929 and 1932, when other components of income shrank more dras- 
tically during the economic collapse of that period. Thereafter a sharp 
fall took place that continued with unimportant interruptions for the 
next twelve years. In 1950, income tax liability attributable to interest 
accounted for only 1.6 per cent of total personal income tax liability. 

Income taxes absorbed a relatively small proportion of total interest 
income reported on taxable returns during the two decades 1921-1940, 
the proportion varying between 3.1 and 7.3 per cent (Table 21). It 
rose sharply in the years following 1940, climbing to 11.0 per cent in 
1941, 17.2 in 1942, 21.2 in 1943, 22.9 in 1944, and 23.6 in 1945, after 
which it gradually declined to 15.9 per cent in 1949 and then rose to 
17.8 per cent in 1950. 


IV. INTEREST INCOME AND INTEREST RATES AFTER TAXES 


For the upper income groups, of course, the proportion of interest 
income absorbed by income taxes rose to much higher levels during the 
1940’s, and the marginal rates rose higher still. The effective federal 
tax rates on the first dollar of additional interest or other ordinary in- 
come for a married man with two dependent children and the cited 
amounts of statutory net income from other sources for the years 
1940-1954 are shown in Table 22. To these, in states imposing their 
own income taxes, should be added the marginal tax rates under the 
state laws. 
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* In 1944 changes to Adjusted Gross Income Classes. 
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TABLE 21 


PROPORTION OF INTEREST INCOME ABSORBED BY 
INCOME TAX, TAXABLE RETURNS, 1918-1950 








Percentage of Percentage of 
interest income interest income 
absorbed by absorbed by 
income tax income tax 





1 


wnownwanevy dvd ONSOR OWN 
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6. 
6. 
3. 
3 
3. 
3 
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4 
4 
3 
3. 
5. 
5 
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1. Did low interest rates and high tax rates deter fixed-interest investment? 


The pronounced increase in personal income tax rates in the 1940’s 
took place after interest rates had declined markedly for most of a 
decade and were still falling. In consequence the after-tax or “take- 
home” yields obtainable by individuals from fixed-interest investment 
suffered a sharp compound reduction. In 1929 a married man with two 
dependent children could obtain a marginal after-tax yield of 4.71 per 
cent from a moderate investment in the average of Moody’s Aaa 
corporate bonds if his statutory net income from other sources was 
$5,000, a “take-home” yield of 4.21 per cent, if $25,000, and one of 
3.93 per cent, if $50,000. By 1950 these after-tax yields had shrunk to 
2.16, 1.71, and 1.21, respectively. Similar radical reductions in after-tax 
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TABLE 22 


MARGINAL TAX RATES ON THE FIRST DOLLAR OF ADDITIONAL 
INTEREST OR OTHER ORDINARY INCOME, 1940-1954 


(Married, with Two Dependent Children) 








Statutory net income (thousands of dollars) 





$25 $100 $1,000 





1940* , 34.1 , 66 78.4 
1941 48 68 78 
1942 58 83 

1943° ° . 60. ‘ 88 
1944-1945 62 90 
1946-1947 ‘ ; 56. . 

1948-1949» ‘ ; 33. ° 63.4 

1950> ‘ 34. , 65.5 

1951> ‘ ‘ 39 73 
1952-1953> ‘ . 75 

1954> 38 72 





* Includes Defense Tax. 
> Rates are for joint return. 
© Includes Victory Tax. 
4 Taking into account the statutory limitation of the maximum effective rate to 90 per cent. 
For the years 1946-1954 varying maximum effective rates were imposed on the whole of net income 
that had the effect of reducing the marginal rates applicable to additional income in the uppermost 
income brackets. The levels of net income, for each year, at which the maximum effective rates became 
operative and the resulting effective rates were as follows: 
Year Income 
1946-1947 $2 ,700 ,000 
1948-1949 1,013 ,379 
1950 1,196 ,702 
1951 1,357 ,685 
1952-1953 1,219,400 
1954 1,261,400 


* Assuming no dividend income. 


agesug? 
qowoong 


yields occurred in bonds of lower quality. The shrinkage was even 
greater at higher levels of income (Tabies 23 and 24, Charts 6 and 7). 

Nevertheless, individual investors as a whole added very much 
larger amounts to their holdings of fixed-interest securities during the 
1940’s than in any previous decade. They expanded their ownership 
of federal securities from $10.1 billion at the end of 1939 to $66.3 
billion at the end of 1950.” This increase was some $14 billion greater 
than the entire net federal debt outstanding at the end of 1939, about 
$12 billion greater than the entire amount of net long-term corporate 
debt outstanding on that date, and some $3 billion greater than the 





2 Treasury Bulletin, March 1955, p. 27. 
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* See Table 7. 
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TABLE 24 


MARGINAL AFTER-TAX YIELDS OF MOODY’S Baa CORPORATE 
BONDS, AT SELECTED NET INCOMES, 1919-1954 


(Married, with Two Dependent Children) 








Moody’s Selected net incomes 


Year 
Baa* 





$5,000 $10,000 $25,000 $50,000 $100,000 





1919 
1920 
1921 
1922 
1923 
1924 
1925 
1926 
1927 
1928 


.89 
.79 
93 
.80 
.02 
.69 
18 
78 
.40 
.40 


5.87 
64 
-76 
81 
-26 
94 
52 
i 
.82 
82 


93 2.90 
.58 .28 
-68 34 
.89 12 
.55 -20 
-19 .89 
14 70 
81 -40 
.49 ll 
-49 ll 


125 
20 


35 
-08 
24 
.83 
27 
87 
.48 
.48 
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1929 
1930 
1931 
1932 
1933 
1934 
1935 
1936 
1937 
1938 


-90 
-90 
-62 
.30 
-76 
32 
-75 
-77 


87 
81 
51 
-93 
45 
.07 
52 
.58 
.83 
-57 


25 
19 
71 
.63 
.36 
12 
.66 
.86 
.07 
.70 


-90 
.84 
25 
42 
.35 
36 
97 
-29 
47 
-00 


48 
42 
72 
09 
41 
91 
64 
96 
-06 
.38 
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1939 
1940 
1941 
1942 
1943 
1944 
_ 1945 
1946 
1947 
1948 


-76 
.54 
47 
34 
-94 
71 
47 
41 
56 
.89 


-02 
13 
25 
.80 
.53 
37 
25 
.34 
.42 


42 
45 
.78 
33 
10 
-90 
82 
-96 
-06 
67 


.03 
.62 
39 
73 
47 
36 
33 
.53 
56 
27 
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1949 
1950 
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85 
68 
71 
74 
90 
2.81 


65 
-50 
36 .92 
-20 .88 
27 0.93 
44 0.98 
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* See Table 7. 
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CHART 6 


MARKET YIELDS OF MOODY’S Aaa CORPORATE BONDS 
COMPARED WITH THEIR MARGINAL AFTER-TAX YIELDS 
TO INDIVIDUALS WITH SELECTED NET INCOMES, 
1919-1954 


(married, with two dependent children) 


Per cent 
7 





ome em came = Aaa yields before tax 
———~ $ 5,000 net income 
————=— $ 25,000 net income 
scccsescesee $ 50,000 net income 
$ 100,000 net income 








eA DA he te 8 8 hg a 8 hes 
1919 ‘29 "30 "35 °40 45 "50 54 
Source: Table 23 





sum of all farm, residential, and commercial mortagge debt and state 
and local government securities then outstanding.” It would seem 
that, in the whole complex of forces operating upon the disposition 
of individuals to invest in fixed-interest securities in this period, the 
deterrent influence of sharply reduced take-home yields was less im- 
portant than the sum of the other forces. 

The “other” forces included the exceptional ones arising out of World 
War II. Between mid-1939 and mid-1946, the total amount of adjusted 
demand deposits and currency in circulation (outside of banks) more 
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CHART 7 


MARKET YIELDS OF MOODY’is Baa CORPORATE BONDS 
COMPARED WITH THEIR MARGINAL AFTER-TAX YIELDS 
TO INDIVIDUALS WITH SELECTED NET INCOMES, 
1919-1954 
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than tripled. Extensive government restrictions were imposed upon 
business and consumer spending and upon prices. The result was that 
investors not only had powerful patriotic motives and swollen financial 
resources to invest heavily in United States government securities during 
the war; they had only limited opportunities in the aggregate to spend 
or invest otherwise. 

We may note that in the case of United States savings bonds, which 
accounted for the bulk of individuals’ direct investments in fixed- 
interest securities, the deterrent influence of the combination of high 
income tax rates and relatively low interest rates was weakened by the 
option given holders to postpone tax liability for the accruing interest 
until redemption. Further, the seeming disposition and ability of some 
taxpayers to avoid reporting this and other kirds of interest receipts 
further tended to reduce the restrictive effect, however great or little 
it would otherwise have been, of the higher tax rates and lower interest 
rates upon fixed-interest investment. 

In contrast, despite their close conceptual and competitive relation- 
ship to interest rates, the yields obtainable from common stocks be- 
haved quite differently. Stock yields moved generally upward between 
1936 and 1943, both before and after allowance for personal income 
taxes, though with occasional important reverses, while interest rates 
were falling almost uninterruptedly. The average yield of Moody’s 
list of 125 representative industrial common stocks, which had been 
4 per cent in 1929 and about 3.4 per cent in 1934-1936, rose to 6.4 
per cent by 1942. After declining in the next four years to 3.8 per cent, 
it rose sharply again to reach 6.8 per cent in 1949, whence it declined 
to 6.5 and 6.3 in 1950 and 1951, 5.6 in 1952, 5.5 in 1953, and 4.7 in 1954. 
Except in 1942-1946, common stocks offered a generally widening 
differential in yield over high-grade bonds until after 1950 (Table 25, 
Chart 8). The dividend return from Moody’s representative industrial 
common stock average had been somewhat below the yield of, Moody’s 
Aaa corporate bonds in 1934 and 1935 and only slightly above in 1936. 
By 1949 it was well over twice that offered by the bonds, and the ab- 
solute difference between the average market yields of the stocks and 
bonds had risen to more than four full percentage points. 

Although the high level and steeply graduated scale of the personal 
income tax tended to narrow this difference after allowance for taxes, 
the spread before income tax became sufficiently wide to give common 
stocks a substantial advantage in current after-tax yield. A married 
man with two dependent children could have obtained a marginal 





% Federal Reserve Bulletin, February 1948, p. 197. 
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TABLE 25 


COMPARISON OF MOODY’S Aaa CORPORATE BOND YIEUDS WITH 
DIVIDEND YIELDS OF MOODY’S 125 REPRESENTATIVE 
INDUSTRIAL COMMON STOCKS, 1929-1954 
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® Seven-month average, June-December. 
Source: Moody's Investor’s Service as reprinted in Survey of Current Business. 


after-tax yield of 5.38 per cent in 1950 from a moderate investment in 
Moody’s list of representative industrial common stocks if his income 
from other sources was $5,000; 4.13 per cent, if $25.000; and 3.01 per 
cent, if $50,000. For the same individuals Moody’s Aaa corporate 
bonds offered after-tax yields of only 2.16, 1.71, and 1.21 per cent 
respectively. 

Under these circumstances, taxable fixed-interest securities doubtless 
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CHART 8 


COMPARISON OF MOODY’S Aaa BOND YIELDS WITH 
DIVIDEND YIELDS OF MOODY’S 125 REPRESENTATIVE 
INDUSTRIAL COMMON STOCKS, 1929-1954 


Per cent 
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became relatively less attractive to many individuals than competing 
kinds of investments, such as common stocks, rental real estate, owner- 
occupied houses, life insurance, tax-exempt securities, etc. For many 
persons who were heavily dependent upon investment income to meet 
their living expenses or who were for other reasons highly sensitive and 
responsive to the current rate of return on their investments, the 











1304 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1955 


readiest of the alternatives to fixed-interest securities lay in common 
stocks. For some, rental real estate, such as commercial buildings and 
apartment houses, provided an attractive alternative because such 
properties—in compensation for their lesser marketability, greater 
risk, and greater need for personal supervision—commonly offer a higher 
direct return than marketable securities, as well as the important added 
attraction that a part of the current cash income from them is not 
taxable, being offset by an allowance for depreciation—an allowance 
which, it is usually hoped, will prove greater than the actual decline 
of the market value of the property. Still other persons, less concerned 
with current cash income or possessing special information or talent, 
were drawn to investments that promised rewards in forms enjoying 
lighter taxation, such as capital gains, oil royalties, life insurance, etc. 

But the very persistence of generally high and rising stock yields in 
the face of much lower and declining bond yields would seem to be con- 
clusive evident -aat investors as a whole, institutional and individual, 
showed a strong preference for bonds as against stocks at any but 
substantial yield differentials. We lack reliable figures for individuals’ 
additions to their holdings of corporate stocks during the 1940’s, but 
the Securities and Exchange Commission has estimated that their 
holdings of corporate stocks, bonds, and other nongovernment securi- 
ties as a whole rose by less than $2 billion during the decade.* 

One factor that should logically have operated to discourage equity 
investment was the tendency of the increases in the level and steepness 
of graduation of tax rates to reduce the net yield advantage after taxes 
of higher-yielding risky investments over lower-yielding safer ones. 
When a taxpayer is subject to a 50 per cent rate on the next increment 
of his income (the 1950 bracket rate for a single taxpayer with net in- 
come a little over $20,000 and for a married couple filing a joint return 
with twice as much), the difference in marginal tax yield between a 
speculative stock returning 7 per cent before income tax and a safe 
bond yielding 3 per cent is reduced from 4 to 2 percentage points. 

In cutting the “take-home” yields to 34 and 14 per cent, respectively 
(assuming an addition to income no larger than the width of one tax 
bracket), the tax leaves their relative relationship unchanged, but it 
nevertheless bears more heavily upon the risky income. A portion of the 
latter may be properly regarded as a kind of insurance premium or 
reserve required by the investor to cover the losses of capital he is 





% Liquid savings estimates of the Securities and Exchange Commission, in National Income Supple- 
ment, 1954, Survey of Current Business, Table 6, p. 166. 
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likely to experience in the long run in connection with such investments. 
Instead of obtaining a continuing 7 per cent yield before taxes, he may 
expect in the long run to average perhaps only 5 per cent. Particularly 
in view of the limited deduction allowed for capital losses, it is from the 
absolute excess of yield after taxes offered by the risky over the safe se- 
curity (and from possible capital gains on other investments) that the 
investor can hope to obtain the funds to make good his capital losses, 
not from the relative yield advantage. By reducing this absolute margin, 
rising or high rates of income tax, particularly if accompanied by severe 
limitations on the allowance for capital losses, may logically be ex- 
pected to influence investors to favor safer securities or, what amounts 
to the same thing, to insist upon compensatory increases in the before- 
tax yield differentials offered by risky over safe securities. 

To take an extreme illustration, consider an investor subject to a 
tax of 90 per cent on his next increment of income. If he choooses a 
risky investment with a market yield of 7 per cent over a safe one 
yielding 3 per cent, his after-tax yield will continue to be 23 times as 
large on the former as on the latter, but the absolute difference in 
after-tax yields will be only 0.4 per cent—a margin that could hardly 
provide significant reserves against capital losses. 

Another portion of the market yield differential offered by risky 
over safe investments may be viewed as the compensation of the ven- 
turesome investor for the service of assuming unpopular risks and 
uncertainties. Although the supply of this service may also be respon- 
sive to the relationship of the yields of risky and safe investments, apart 
from their absolute levels, it seems unreasonable to suppose it is not 
significantly responsive to the absolute rate of “take-home” compensa- 
tion offered for it. To use our previous hypothetical example, a gross 
advantage in after-tax yield of less than 0.4 per cent (to allow something 
for reserves against possible capital losses) could not reasonably be 
expected to attract as much venturesome investment as a bigger ab- 
solute differential in yield. 

Finally, there is the difficulty of recouping capital losses from or- 
dinary income when the latter is taxed at high rates and when only 
severely limited offsets are allowed against it for capital losses. If the 
risk of loss that presumably inheres in a stock yielding 7 per cent at a 
time when good bonds are yielding 3 per cent should materialize, and 
the investor sold some or all of his shares for less than he paid for 
them, he would need to earn and save considerably more ordinary 
income than the amount of his loss to replace the latter, unless he 
should be fortunate enough to obtain offsetting capital gains. For 
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example, if a married couple filing joint returns in 1950 sustained a long- 
term capital loss of $28,000, and this sum was equal to their average 
annual income, they would need to increase their incomes by a mini- 
mum of nearly $40,000 in the aggregate during the next ten years and 
to save the whole increase after income taxes on it to recoup their 
capital loss out of ordinary income without reducing their consumption 
standards (assuming 1950 tax law, including a capital loss allowance 
against ordinary income of $1,000 each year for six years). If their 
average annual income approximated $50,000, and this was also the 
amount of their capital loss, they would require additional ordinary 
income aggregating about $100,000 in the next ten years to recoup their 
capital loss in similar fashion. Unless an investor believes he has good 
possibilites of offsetting possible capital losses by future capital gains, 
the effect of the foregoing kind of calculation is to accentuate in his 
mind the importance of safety of principal as against yield. 

The fact that relatively high and generally rising stock yields per- 
sisted in the face of falling interest rates on high-grade obligations 
during the fifteen years ended in 1950 is doubtless attributable in vary- 
ing degree to other influences as well as to tax considerations. Never- 
theless, it seems reasonable to infer from the prevailing yields that in- 
vestors made substantia! allowance for the heavy personal income taxes 
on dividends and the limited allowance for capital losses in their ap- 
praisal of common stocks and were content during this period to accept 
a greatly reduced after-tax rate of return, relative to both previous 
levels and to current yields on equity investments, for the safety of 
high-grade fixed-interest securities. 

The foregoing considerations apply most directly to investments 
that promise their rewards mainly in the form of more or less regular 
incomes. But a considerable proportion of risky investments are ex- 
pected to produce much of or all their yield in the form of capital gains, 
which may be loosely defined as the profits obtained by selling stocks, 
bonds, land, or other property not a part of the seller’s stock in trade 
for more than they cost him. Such gains are taxed at very much lower 
rates than ordinary income in the United States (one-half or less, pro- 
vided the property is held more than six months) and most other coun- 
ties, and are completely exempted frum income tax in some of them.” 
In consequence, risky investments promising much of their return in 
this form offer investors the possibility of “take-home” yields that are 





% See Lawrence H. Seltzer, with the assistance of Selma F. Goldsmith and M. Slade Kendrick: 
The Nature and Tax Treatment of Capital Gains and Losses (National Bureau of Economic Research» 
1951), Chaps. 1 and 10. 
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substantially larger than those obtainable in the form of current in- 
come from either safe or risky investments. The combination of high 
tax rates on ordinary income and low ones on capital gains therefore 
tends to discourage some forms of risky investment and to encourage 
others.?” 

Astudy by Butters, Thompson, and Bollinger, based upon interviews 
with several hundred active investors representing various levels of 
income and wealth, presents evidence that although the investment 
decisions of a majority of the investors interviewed were not signifi- 
cantly influenced by taxes, the tax structure decreased the willingness 
of the interviewed investors in the aggregate to make equity-type in- 
vestments; that of the investors who were influenced, a greater number 
were moved by tax considerations to more conservative than to more 
venturesome investments; that a significant proportion of the investors 
in the highest income groups were moved to shift to more venturesome 
investments in an effort to obtain the preferential tax treatment af- 
forded capital gains; and that the ability and disposition of the upper- 
most income groups to supply equity capital remained strong in the 
face of high taxes.?® 

When yields on common stocks fail to move in the same direction as 
yields on high-grade bonds, a part of the explanation is doubtless to 
be sought in the changing prospects for capital gains. When these pros- 
pects include seveve uncertainties or a deteriorating business outlook, 
high or rising stock yields may persist in the face of low or declining 
interest rates, as in most of the period between 1929 and 1949. The 
opposite movements may occur when these prospects are regarded more 
favorably by investors, as was presumably the case during the rise in 
stock prices and decline in stock yields in the face of firming interest 
rates in 1949-1954. 

Among the nontax influences (or those indirectly connected with 
taxation) that may well have contributed to the exceptionally wide and 
generally increasing spread between stock and bond yields after 1936 
were the continuing strong growth in the resources of banks and other 
institutional investors and the concentration of their investment de- 
mand upon fixed-interest obligations. The investments of commercial 
banks, savings banks, life insurance companies, and other financial 
institutions are largely restricted by law, tradition, and prudential con- 
siderations to high-grade bonds and mortgages. Net additions to the 

2” Tbid., Chaps. 6 and 7; and J. Keith Butters, Laurence E. ‘Thompson, and Lynn N. Bollinger, 
Effects of Taxation: Investments by Individuals (Graduate School of Business Administration, Harvard 


University, 1953). 
28 Butters, Thompson, and Bollinger, op. cit. 














1308 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 10955 


supply of these, except for government obligations, were small or 
negative during the 1930’s and the first half of the 1940’s. On the 
other hand, the funds available for investment by financial institutions 
increased enormously. 

Commercial banks became insistent and large-scale bidders for 
government and other high-grade securities. Their reserves were swelled 
through huge gold imports in the 1930’s and through large purchases 
of government securities by the Federal Reserve Banks during the 
1940’s. Yet the total loans of all commercial banks in the United States 
amounted to less at the end of 1945 than at the end of 1929 or even 
1924. At the end of 1929 their loans had amounted to about three times 
their holdings of all kinds of securities; at the end of 1945 their total 
loans amounted to only about one-fourth of their holdings of securities. 
With the loan demand restricted and new issues of high-grade corporate 
and state and municipal bonds small, the banks had no major alterna- 
tive outlet for their greatly enlarged funds other than federal govern- 
ment securities during the 1930’s and early 1940’s. Their holdings of 
these rose from $5 billion in 1929 to $91 billion in 1945, when they 
accounted for about three-fourths of the total earning assets of com- 
mercial banks. During the several years following 1945, when the 
demand for business and mortgage loans rose greatly, the banks were 
enabled to obtain additional »2serves to meet this demand by selling 
federal securities at more or less pegged prices to the Federal Reserve 
System. 

Similarly, the life insurance companies and other institutional in- 
vestors were insistent bidders for government and other high-grade 
obligations during the later 1930’s and most of the 1940’s because of 
their rapidly growing resources and the small volume of alternative 
investment opportunities. The 49 life insurance companies reporting 
to the Life Insurance Association of America increased their total 
admitted assets from $21.4 to $43.3 billion between 1935 and 1946. 
Of the $21.9 billion increase, added holdings of United States govern- 
ment securities accounted for $17.2 billion and public utility bonds for 
$3.2 billion. 

For banks, insurance companies, and other financial intermediaries, 
the bulk of the funds that poured in each business day had to be in- 
vested fairly promptly in high-grade fixed-interest obligations at 
whatever yields were obtainable, with little regard to the higher yields 
offered by common stocks and other equity investments. The down- 
ward pressure upon the yields of high-grade fixed-interest securities 
therefore became especially severe. Since individuals apparently also 
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favored fixed-interest obligations during this period, partly for various 
reasons suggested above, the exceptionally wide spread between bond 
and stock yields appears less surprising than at first. 


Vv. FUTURE TRENDS IN PERSONAL INTEREST INCOME 


A comprehensive analysis of probable future trends in personal in- 
terest income would involve detailed consideration of a large number of 
complex influences, In an immediate sense the amount of direct per- 
sonal interest income will be determined, of course, by the volume of 
public and private debt held by individuals and the level and move- 
ment of interest rates. But a host of intricate and interacting causal 
forces will be operating behind these immediate determinants. 

The amount of public debt available for private investment, for 
example, will be determined partly by unpredictable events, such as 
war and business depression, and partly by deliberate decisions, to be 
made from year to year, with respect to the level of federal, state, and 
local government expenditures, the extent to which the latter are to 
be financed by borrowing rather than current taxation, and the pro- 
vision for retiring or otherwise removing debt from the investment 
markets—including such quasi retirements as the important amounts 
acquired each year by the social security and other governmental 
trust funds. 

Changes in the volume of private debt will be influenced by all the 
varied forces that determine the amount of private new capital forma- 
tion. These include the actual and expected state of business, the 
demands upon the physical capacity of public utility and other im- 
portant capital-using industries, the degree to which important tech- 
nological changes occur that create opportunities for profitable large- 
scale investment, the amount of residential construction made profit- 
able by population growth, geographical shifts in population, obsoles- 
cence, and the terms of mortgage financing, etc. 

But the extent to which private new capital formation will lead to 
growth in private debt will in turn depend upon still other factors. 
These include, among others, the volume of funds becoming available 
for investment to financial institutions through net receipts from in- 
dividuals, the amounts becoming available to business corporations 
each year from retained earnings and depreciation charges, and the 
relative popularity of bonds and mortgages as against common and 
preferred stocks among investors. 

Of the total amount of public and private debt made available for 
investment, the amounts to be held directly by individuals will be 
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affected by the competition of banks, insurance companies, pension 
funds and other institutional investors for debt obligations, and by the 
extent to which individuals are moved to use their savings to pay off 
home mortgages and to add to their life insurance policies and annuity 
and pension contracts, etc., instead of acquiring interest-yielding in- 
vestments directly. 

The level and movements of interest rates will be profoundly in- 
fluenced, of course, by the policies of the Federal Reserve System, as 
well as by the myriad other factors operating upon the supply of and 
demand for loanable funds. 

Without attempting a detailed quantitative analysis of the foregoing 
and other forces, it may nevertheless be useful to note broadly a few 
general tendencies. 


1. Long-term uptrend in total of public and private debt 


With only short-lived interruptions the total of net interest-bearing 
public and private debt in the United States has been increasing for 
many years (Table 6). It rose from $106 billion to $474 billion between 
1918 and 1952, or by an average of $11 billion a year. The cumulative 
or compound rate of increase was about 4} per cent a year. The only 
individual years in which the total did not rise were the depression 
years 1931-1933 and 1938, and the year 1946, when the Treasury re- 
tired $20 billion of debt with surplus funds part of which it had ob- 
tained through its Victory Loan only a few months previously. 

The largest single source of increase in 1918-1952 was the expansion 
in the federal debt that accompanied World Wars I and II. Neverthe- 
less the growth of private debt and that of state and local governments 
accounted for 45 per cent of the total debt expansion in this period. 
The aggregate amount of nonfederal debt outstanding declined in 
each of the depression years, 1931-1935 and 1938, ag well as in the 
early war years of 1942-1943, but rose in all the othr years of the 
period. | 


2. Possible decline in net federal debt 


Barring a change in traditional federal budget practice, no large 
increase, if any, is to be expected, except during periods of war or 
business depression, in the net federal debt—the amount in other hands 
than federal trust funds and credit agencies. On the contrary, the 
steady absorption of the federal debt by federal trust funds and credit 
agencies, averaging $2.8 billion annually in 1944-1954, reduces by 4 
like amount the volume of federal securities that would otherwise be 
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available to private investors. Hence, a balanced federal budget in the 
legislative and administrative sense, which includes provision for 
additions to the reserves or the trust funds, may mean in the future a 
reduction of perhaps $3 billion or more a year in the net interest-bearing 
federal debt outstanding in private hands. In this sense the operation 
of the social security and other government trust funds may be re- 
garded as having the same effect as a sinking fund of like magnitude 
for reducing the amount of public debt in private hands. Some who 
fear that this rate of debt reduction would absorb too much of tax- 
payers’ incomes in ordinary times have proposed that Congress seek 
to balance only the cash budget: that is, to raise only enough tax reve- 
nue each year to meet the year’s actual cash expenditures, including 
outpayments by the social security and other trust funds, but not 
additions to their reserves. Others regard the public debt acquired 
by the social security and other trust funds as not extinguished, but 
alive, and as representing the funded portion of the government’s 
growing obligations under these trust funds. By gradually transferring 
the privately held federal debt to the trust funds, the federal govern- 
ment lightens its interest obligations to private holders as its social 
security obligations increase. 


3. Special stimulants aided high rate of private debt expansion after 
World War II 


The net interest-bearing federal debt declined by $24 billion between 
the ends of 1945 and 1953, most of the decline being accounted for by 
the 1946 redemptions noted above. But other borrowers, most notably 
home buyers, business corporations, and state and local governments 
have been increasing their interest-bearing debts at a record rate since 
the end of World War II. Private and state and local government debt 
more than doubled in the eight years ended in 1953. The average an- 
nual increase in short- and long-term nonfederal interest-bearing debt 
in these eight years was $21 billion, and in long-term alone, $14 billion. 
The largest expansion in long-term obligations took place in the non- 
corporate mortgage debt, which rose in the eight years from $32 billion 
to $92 billion. Corporate long-term debt advanced from $38 billion to 
$79 billion; the nonmortgage debt of unincorporated business, farmers, 
and consumers, from $20 to $53 billion; state and local government 
debt, from $13 to $30 billion; and farm mortgages, included above in 
the noncorporate mortgage debt, from $5 to $8 billion. 

Clearly if any such rate of growth were to continue, the total of 
personal interest income, direct and indirect, would tend to rise rapidly 
and substantially—barring a major decline in interest rates. 
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The great expansion of private and state and local government debt 

in 1946-1953 was a natural accompaniment of the enormous upsurge 
in investment that then occurred, for the greatest part of additions to 
interest-bearing debt is normally incurred to help finance new invest- 
ment. Gross private domestic investment (as measured by the Depart- 
ment of Commerce), which was $16.2 billion in 1929, the previous peak 
year, averaged $42.6 billion annually in the eight years ending in 1953, 
and exceeded $50 billion in four of them. 
. The exceptional volume of new investment and debt in the years 
immediately following World War II doubtless owed much to special 
influences, some of which have lost their force. The period benefited 
from the pent-up capital needs accumulated during the depression and 
war for all manner of buildings and durable producers’ and consumers’ 
goods, and for the replenishment of inventories at all levels of produc- 
tion, distribution, and consumption. 

Beginning about the middle of 1950, further stimulus to investment 
and to borrowing was provided by large-scale orders for military ma- 
terials and equipment for the Korean War. A special incentive to a 
wide range of new investment was added by a law permitting business 
enterprises to write off in five years, for tax purposes, varying propor- 
tions of the cost of all projects for capital expenditures that were 
adjudged by the Office of Defense Mobilization to contribute to mili- 
tary defense. 

The then-prevailing high rates of corporate income and excess 
profits taxes, coupled with the expectation that these rates would be 
reduced substantially in a few years, lent added attractiveness to proj- 
ects eligible for rapid amortization. They also gave special encourage- 
ment to borrowing to finance much of the costs, for interest expense 
was deductible from taxable income. Thus, a corporation subject to a 
marginal rate of tax of 70 per cent on any additional income would 
find that its interest cost on money borrowed at 4 per cent was only 
1.20 per cent, after allowance for taxes saved, whereas the dividends 
it would have to offer on additional preferred or common stock were 
not deductible from taxable income. Moreover, because a firm’s bor- 
rowing increased its invested capital credit under the excess profits tax, 
the net effect of borrowing was sometimes to reduce its total tax 
liability by an amount greater than the gross interest cost of the bor- 
rowed funds.*® 





29 Ses Donald C. Miller, “Corporate Taxation and Methods of Corporate Financing,” American 
Economic Review, 42 (December 1952), 839-854. 
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4. Other forces including higher price level favor further debt expansion 


Since most of or all the shortages accumulated during the depression 
and World War II must now be presumed to have been made up, and 
since the bulk of the more urgent and more profitable investment 
projects stimulated by the Korean War and the emergency amortiza- 
tion certificates also must be presun.zd to have been completed, a 
slackening in the growth of nonfederal investment and debt, and even 
occasional temporary reversals in trend, such as have occurred before, 
would not be surprising. 

Nevertheless, substantial average annual additions to the amount of 
nonfederal debt outstanding seem probable for some time to come. In 
the absence of other supporting evidence, or of evidence to the con- 
trary, it would be reasonable to expect such additions because they 
would be consistent with a long-established trend. As previously noted, 
and as may be seen in detail in Table 6, total nonfederal interest-bear- 
ing debt rose in twenty-five of the thirty-five years 1918-1953, and 
nearly all the declines were concentrated in the Great Depression and 
World War II. This long-term trend may be interpreted as reflecting 
the strong tendency of our growing economy to make sizable net addi- 
tions to its stocks of capital equipment and other goods in all tolerably 
prosperous years and, because of a variety of faciors in its institutional 
organization, to finance a considerable fraction of the value of such 
additions by borrowing. Nothing in the recent behavior of the economy 
indicates any imminent weakening in these long-run tendencies. On the 
contrary, various specific considerations suggest continuing strength. 

We have noted that noncorporate residential and commercial mort- 
gage debt rose by $60 billion in 1945-1953 and constituted the most 
important component of long-term debt expansion in that period. The 
availability of FHA and VA mortgages, with their potent attractions 
of small down payments and long amortization periods for borrowers, 
and of mortgage insurance or guaranty for lenders, was of outstanding 
assistance in this expansion. These powerful aids to residential con- 
struction and to the growth of mortgage debt are scheduled to con- 
tinue. In legislation enacted in 1954, the minimum down payment 
under FHA-insured mortgages was reduced to 5 per cent of the first 
$9,000 of purchase price for new houses and 10 per cent for old houses, 
plus 25 per cent of any excess for both new and old houses, subject to 
a ceiling for insured mortgages of $20,000 on one- and two-family 
homes, $27,500 for three-family homes, and $35,000 on four-family 
homes. 

Other forces besides the ease of finance promise to sustain the de- 
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mand for new dwellings (and the creation of mortgages) at a relatively 
high level, though not necessarily or uninterruptedly at the record 
levels of recent years. These include the high rate of population growth, 
the continuing geographical shifts of our population to the Northwest 
and the South, and the country-wide shift of population from the in- 
teriors to the outskirts or suburbs of larger cities and towns. 

The same population factors, accentuated by rising standards of 
consumption, are creating enlarged demands upon state and local 
governments for public works—schools, streets, highways, sewage and 
water systems, etc. A considerable part of such expenditures is com- 
monly financed by borrowing. The growth and movement of population 
is similarly enlarging the requirements for new facilities of public 
utility enterprises, which characteristically finance a substantial pro- 
portion of additions to their invested capital by borrowing. Electric 
light and power, gas, telephone, and pipeline, and other public utility 
corporations accounted for nearly one-half, and, together with the 
railroads, for 77 per cent, of the entire $36 billion par value of corpora- 
tion bonds outstanding in the United States at the beginning of 1951." 
State and local government interest-bearing debt held outside of gov- 
ernmental trust and sinking funds amounted to $24 billion at that time 
and to $31 billion at the end of 1953. 

For industrial corporations as well as public utility and railroad 
enterprises, the rate and character of technological change will also 
influence importantly the volume of new investment, and, therefore, of 
debt creation. Two sources of change that may conceivably require 
heavy investment in the relatively near future are the application of 
atomic energy to civilian uses and the broadening application of 
electronic machinery to industrial processes. Expressly intended to 
stimulate new investment and speedier replacement of old buildings 
and equipment were the optional provisions for the more rapid writing- 
off of new buildings and equipment enacted in the Revenue Code of 
1954. 

Consumer installment credit, stimulated by the increasing variety of 
popular durable consumer goods produced and by the still-growing 
market for consumer paper among banks and specialized consumer 
finance companies, which hold the bulk of it, promises to expand at 
least as rapidly as disposable personal income. The amount outstanding 
fell from $3.2 billion to $2.5 billion in 1942-1945, when the output of 
automobiles and other durable consumer goods was severely restricted 
by wartime controls, but it increased nearly ninefold in the next nine 





80 W. Braddock Hickman, The Volume of Corporate Bond Financing since 1900 (National Bureau of 
Economic Research, 1953), Table A-1, pp. 250 ff. 
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years. The $22.2 billion outstanding at the end of 1953 equaled 8.9 per 
cent of total disposable personal income in that year as compared with 
5.4 per cent in 1939. 

Because of the rise in the price level since 1939—retail prices ap- 
proximately doubled in 1939-1953—the dollar amounts of increases in 
the volume of outstanding debt will tend to be much greater than would 
otherwise have been the case. The upward revaluation of durable old 
assets, such as houses and other buildings, will tend to bring about an 
enlargement of the debts outstanding against them, particularly as 
they change hands; and the higher prices of new assets, assuming other 
things equal, including an unchanged proportion of borrowing against 
their costs, will lead to larger annual additions to the dollar volume of 
debt that in the past. Houses that were formerly built and sold for 
$6,000 and mortgaged for perhaps $4,500, now cost perhaps $12,000 to 
build and, with FHA assistance, command mortgages as large as 
$10,800 each. Thousands of old houses that still carry relatively modest 
mortgages issued during or shortly after the Great Depression will 
gradually pass into the hands of new owners at greatly increased 
valuations and probably will become subject to much larger mortgage 
debts. 

As cities and states build additional schools and highways and make 
other extensions of their physical plants to meet the requirements of an 
expanding and shifting population and of higher standards for public 
services, the cost of the new facilities in dollars per physical unit will 
tend to be a good deal more than in the prewar period, perhaps two or 
more times as much (assuming no significant cost saving through 
technological advances); and, with an unchanged proportion of debt 
financing, the new facilities would entail a correspondingly greater 
amount of new debt. Maturing state and local government debt, in- 
curred when prices were lower, will tend for this reason, other things 
being equal, to be less than the new debt. 

In the same way, the transmission and distribution lines, telephone 
cables, pipelines, and buildings and machinery required by public | 
utilities to meet growth and regional shifts in demand for their services 
will all tend to cost more than formerly per physical unit, and will 
therefore tend to cause greater additions to debt than the same rate of 
physical growth previously occasioned. In some instances it is likely 
that current depreciation charges based upon historical cost will be 
found inadequate, among industrial as well as public utility enter- 
prises, to provide for the eventual replacement of various types of 
capital assets, with the result that greater resort will have to be had to 
retained earnings and new issues of securities, including debt issues. 
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At the same time the borrowing power of many enterprises will be 
enhanced because their old debts will constitute a reduced fraction of 
the present value of their assets, and their debt charges, a reduced 
fraction of their current earning power. 





5. Increasing demand for debt securities by financial institutions 


The growth of debt is promoted not only by the existence of attrac- 
tive investment opportunities for borrowers but also by increases in the 
power and disposition of lenders to lend. Financial institutions rake an 
important contribution in this respect by mobilizing the savings of 
thousands of individuals and assuming the responsibility and risks of 
investing them. Favorable to a continuing growth in debt, therefore, is 
the expanding volume of both voluntary and contractual or compulsory 
saving and investing by individuals through organizations whose in- 
vestments characteristically consist mainly of fixed-interest obligations. 

A major example is life insurance. As estimated by the Institute of 
Life Insurance, the total admitted assets of life insurance companies, 
largely bonds and mortgages, rose by approximately $40 billion, or 
more than 100 per cent, between 1943 and 1953, and have been growing 
in the last few years by more than $5 billion a year. Some, though not 
all, of this increase is doubtless to be ascribed to the rise in the general 
price level and in the level of wages and salaries, which encouraged 
individuals to take out larger insurance policies; but the dollar volume 
of premium receipts and of funds becoming available for investment is 
now geared to the higher price level. 

The recent and continuing growth of private pension funds in busi- 
ness is another example. The following figures, taken from a study pre- 
pared for the Joint Congressional Committee on the Economic Report 
by the National Planning Association (Pensions in the United States, 
1952), indicate the scale of the recent expansion in private pension 

plans: 
ESTIMATED NUMBER OF PRIVATE PENSION PLANS 
AND NUMBER OF PERSONS COVERED 











Number of persons 
Year Number of plans covered 
(millions) 

















720 2.4 

1935 1,090 2.6 
1940 1,965 3.7 
1945 7,425 5.6 
1950 12,330 8.6 
14,000 9.6 
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In the same study it was estimated that employers and employees 
contributed over $2 billion in 1951 to private pension plans, from which 
perhaps $300 million was paid out; that contributions under govern- 
ment programs, other than the old-age and survivors insurance trust 
fund and other than under programs for the armed forces, amounted 
to $2.1 billion, with benefit payments approximating $1 billion; and 
that the net increase in the old-age and survivors insurance trust fund 
in 1952 approximated $2 billion. Ida C. Merriam, of the Social Security 
Administration, estimated that employer contributions to private pen- 
sion plans in 1954 were at the rate of about $2.3 billion a year, employee 
contributions, $0.5 billion, current annual additions to private pension 
fund reserves, about $2.4 billion, and the accumulated reserves, about 
$17 billion.*! Although sizable amounts of the private pension fund 
reserves may come to be invested ir. equity ownership of rental proper- 
ties and in common and preferred stocks, it seems probable that a large 
part of these reserves, as well as all the reserves of public pension plans, 
will be invested in interest-bearing obligations. At the end of 1954, 
corporate and government bonds comprised 73 percent of the non- 
cash assets of all uninsured pension funds of U. S. corporations other 
than banks, insurance companies, and railroads; common stocks, 19 
percent; and preferred stocks and other assets, roughly 4 percent each 
(S.E.C. Release no. 1335, Oct. 12, 1955). 

The widespread adoption of the practice of providing in mortgage 
contracts for monthly installment payments on the principal, relatively 
rare until the establishment of the FHA, is likewise producing a large 
amount of contractual saving by individuals, most of it adding to the 
inflow of funds to institutions and to the pressure upon them to seek 
replacement investments. Not all of these installment principal pay- 
ments represent net saving in an economic sense, of course, because de- 
preciation and obsolescence of the buildings must be allowed for. But 
the equal monthly amortization payments, after the first several 
years, are usually larger than the concurrent loss in the economic value 
of the property, even when the mortgage loans are for as long as thirty 
years. Moreover, the amount of funds flowing to lenders is determined 
by the gross, not the net, saving incorporated in installment payments 
of principal. Because these principal repayments reduce the volume of 
outstanding debt by the same amounts that they supply funds for new 
debt, they obviously do not contribute directly to an expansion of debt. 
On the contrary, by themselves, they constitute a major kind of debt 
reduction. But the daily receipt of huge amounts of such payments by 





"In a paper entitled “Social Security Programs and Economic Stability,” presented at the Uni- 
versitiee-National Bureau of Economic Research Conference on policies to combat depressions, May 15, 
1954, 
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TABLE 26 


INVESTMENTS OF INDIVIDUALS IN SAVINGS ACCOUNTS, - 
UNITED STATES SAVINGS BONDS, AND LIFE 
INSURANCE RESERVES, 1920-1953 


(Millions of Dollars) 








Savings accounts 





Mutual 
S&LA’s* savings 
banks? 


Com- 
mercial 
banks® 


Postal 
savings® 


Savings 
bonds 
U. 8. 

govern- 
ment® 


Reserves 
of life 
insurance 
com- 
panies! 





4,806 
5,541 
5,985 
6,484 
6,912 


7,349 
7,799 
8,352 


13 ,332 


15 ,332 
16 ,813 
17,744 
18,385 
19 ,269 


20 ,002 
20 ,880 
22 ,578 
1953® 24,345 


10,546 
11,079 
12,289 
13,656 
15,044 


14,865 


15,403 
15,523 
16 ,056 
19,001 
23 ,871 


29 ,929 
33 ,447 


35 ,200 
36 ,592 
39 ,331 
42,001 


166 
148 
135 
135 


Bass 
NoOnww 


761 
652 
730 
373 
411 


376 
356 
245 

95 


Ib 


— 
- -— 


153 
475 
964 
442 
900 
800 
400 
,400 
700 
200 
900 
200 
200 
800 
300 


SSSEE SFGan 


- 2-2 2 © 


3& 


49 ,200 
49 ,300 


5,488 
5,893 
6,360 
6,981 
7,706 


8,592 
9,594 
10 ,648 
11,782 
12,801 


13,690 
14,293 
14,319 
14,613 
15,687 


17 ,203 
18,736 
20,181 
21,512 
23 ,024 


24,663 
26 ,592 
28,734 
31,365 
34,212 


37 ,509 
40,713 
43 ,820 
47 ,139 
50,231 


53,630 
57,140 
61,140 
65,500 


23 ,508 
25,278 
27,709 
30,255 
33 ,363 


36,580 
39 507 
43 ,099 
45 ,823 
47 ,169 


48 , 267 
46,716 
42,551 
41,077 
43 ,039 


45 ,567 
48,418 
51,064 
52,979 
55 ,703 


59,148 
64,079 
75,211 
94,104 

116 ,326 


136 ,048 
147,100 
155 ,734 
162 ,700 
169 ,718 


175 ,445 
182 ,593 
194 ,042 
206 ,435 


11,449 
12,393 





* Investments in savings and loan associations including savings accounts, deposits and invest- 
ment securities. Does not include shares pledged against mortgage loans or investments by United States 
government. Source: Home Loan Bank Board. 

> Time deposits. Source: Comptroller of the Currency, 1920-1927; National Association of Mutual 
Savings Banks, 1928-1947; and Federal Deposit Insurance Corporation (time deposits of individuals, 
partnerships and corporations), 1948 to date. 

® Time deposits of individuals, partnerships and corporations. Source: Comptroller of the Currency, 
1920-1947; and Federal Deposit Insurance Corporation, 1948 to date. 
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institutional lenders exerts a potent, continuing pressure upon the 
latter to seek new loans. This pressure is favorable to freer lending and 
easier borrowing. And it is not unlikely that the widespread use of the 
monthly repayment house mortgage induces many of the home buyers 
to save more than they would save otherwise, for an expanding equity 
in a mortgaged house does not adequately serve all the purposes of 
individuals’ saving—it is not a ready source of liquid funds, for example. 

The increasing amount of saving and indirect individual investment 
that is compulsory under the social security system and most private 
pension plans and that is contractual under life insurance policies 
doubtless takes place to some extent at the expense of individuals’ 
direct investments. But the restricted availability of pension rights as 
a source of liquid funds prior to an individual’s retirement renders 
them an inadequate substitute for direct personal investments. Be- 
cause of their special objectives, the same is generally true, though in 
smaller degree, of life insurance policies. It would not be surprising, 
therefore, if the effect of these institutional developments, as of the 
installment payment home mortgage, is to promote a net increase in 
personal saving and investment. To the extent that such increase takes 
the form of additional deposits in savings or checking accounts with 
banks and shares in building and loan associations, the volume of funds 
seeking fixed-interest investments through financial intermediaries 
would be increased further—barring a radical change in the predomi- 
nant type of investment sought by financial institutions. Finally, unless 
material changes in law are made or large gold discoveries occur, in- 
creases in the country’s supply of money to meet the needs of an ex- 
panding population and output will require substantially equal addi- 
tions to the amount of debt securities purchased by the banking system. 

It is striking to note that individuals raised their current additions 
to their holdings of savings accounts, United States saving bonds, and 
life insurance reserves from an average of $2.6 billion a year in the 
prosperous 1920’s, and $0.9 billion in the depressed 1930’s, to $11.4 
billion a year in the 1940’s, and $9.2 billion a year in the four years 
1950-1953, inclusive (Table 26). 





4 Due depositors: Outstanding principal and accrued interest on certificates of deposits, outstanding 
savings stamps and unclaimed deposits. Source: Post Office Department. 

© Current redemption value of savings held by individuals at year-end: War Savings Securities, 
1920-1928; and United States savings bonds, 1935 to date. Does not include holdings of corporations, 
unincorporated businesses, pension funds, etc. Source: Treasury Department. 

f Accumulations in United States legal reserve life insurance companies include reserves plus divi- 
dends left to accumulate, minus premium notes and policy loans. Source: Institute of Life Insurance. 

© Preliminary estimates. 

Note: Does not include savings accounts in credit unions which amounted to approximately 
$1,500 million at the close of 1953. 

Source: Operating Analysis Division, Home Loan Bank Board. 
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Not all the expanding resources of financial intermediaries will seek 
fixed-interest investments. A small proportion of the funds of life in- 
surance companies, for example, is being invested in income-producing 
real estate and in common stocks. Private pension funds, because they 
are much less subject to statutory restrictions in this regard, may even- 
tually devote a substantial fraction of their resources to equity invest- 
ments. The number and resources of investment trusts designed pri- 
marily for investments in common stocks are increasing. But it appears 
safe to say that the greater part of the aggregate investment resources 
of financial intermediaries will continue to seek fixed-interest securities 
for many years to come. 


6. Debt creation and prosperity 


The considerations reviewed thus far do not prove that the volume 
of interest-bearing debt outstanding will necessarily rise, on the aver- 
age, over the next ten or fifteen years, but they offer ground for believ- 
ing such a rise to be likely. The rise will almost certainly take place if 
our economy avoids serious depression. This is true because a good level 
of business activity, by creating bright prospects, itself stimulates in- 
vestment and aggregate borrowing. It is also true that a substantial 
increase in debt commonly functions as a major stimulus to prosperity. 
It contributes vitally to prosperity by financing important amounts of 
capital expenditures without which a high level of economic activity 
cannot readily be achieved. Prosperity usually requires a large volume 
of capital expenditures both because a sizable portion of our industrial 
plant and labor force is more or less specialized, in the short run, for 
producing buildings, machinery, and other capital goods (or the 
equivalent in military goods), and because our population desires not 
to spend all its income for perishable consumer goods but to use some 
of it for long-lasting capital and consumer goods and to acquire claims 
against the future—much of them in the legal form of debts. If the 
capital goods segment of our economy is not active, the consumption 
goods industries will also not be able to operate at full throttle, for they 
are geared to make and sell enough of their products for us all, including 
those in the capital goods industries, not only for their own employees 
and stockholders. 

In spite of the well-founded traditional concern about the dangers 
of a large debt both to the debtors and to society at large,** an expand- 
ing volume of debt as our economy grows would seem to be necessary 





8 The dangers of debt obligations arise mainly from their inflexibility, The essential purpose of a 
debt contract from the creditor’s standpoint is to provide a margin of protection for his claims to interest 
income and principal against the unavoidable risks and fluctuations of ownership of tangible property 
or of shares in business enterprises. For this protection he is willing to accept a limited and usually 
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for the full functioning of our economic institutions. By the purchase of 
various kinds of debt securities, both directly and through financial 
institutions, individuals are enabled to satisfy their desire to obtain 
definite and protected claims against the future in return for making 
available a portion of their current incomes to business corporations, 
governments, and others; and the latter, by productive use o7 the bor- 
rowed funds, help to maintain the nation’s incomes and employment at 
a high level in the present, and place themselves in a position to meet 
the future obligations assumed as they mature. So long as the public 
continues to buy additions to its holdings of life insurance, pension and 
annuity contracts, and other fixed claims to money, there will be a de- 
mand by financial intermediaries for additional debt securities—bar- 
ring a radical change in the predominant type of investment sought by 
financial institutions. Similarly, as previously noted, unless we have 
material changes in law or large gold discoveries, increases in the 
country’s supply of money to meet the needs of a growing population 
and output will require substantially equal additions to the amount of 
debt securities held by the commercial banks. 

Direct ownership of land, buildings, machinery, and other tangible 
assets, or of shares of common stock in business enterprises, also pro- 
vides claims on the future. Because such claims are not in fixed dollar 
amounts nor fixed in time, but vary with the incomes, dividend decla- 
rations, and capital values actually realized, downward movements in 
their amounts from time to time are more or less expected and do not 
usually give rise to difficulties as serious as those resulting from defaults 
of fixed-debt contracts. For this reason, the financial organization of 
society may be said to be less vulnerable to adverse developments when 
claims on the future take the form of equity ownership predominantly 
and fixed debts are small. 

As indicated above, however, the prevailing customs and practices 
of our population and our financial institutions with respect to saving 
and investment appear to favor the creation of a growing volume of 
debt. Well-informed lenders may be expected in self-protection to 
strive to limit the total amount and the maturities of each of their 
individual loans to sums judged to be well within the capacity of the 





lower rate of return. But the creditor's secured position is obtained by increasing the risks of the debtor. 
The latter's promises are usually unconditional as to timing, leaving him little or no room for adjusting 
to temporary adversity. Because of the large volume of short-term debt from some business enterprises 
to others, and from business enterprises as a whole to the commercial banks, and because of the general 
tendency of creditors to try to reduce credit in troubled times, defaults can easily become contagious. 
Severe and often fatal difficulties may then be created for many debtors (and their creditors) who would 
have surmounted their troubles if given more time. If the defaults are sufficiently large and widespread, 
they may do severe damage to financial confidence, and thereby impede for a more or less protracted 
period the ready flow of fresh money savings into new investment. 
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borrower, and to the extent that such lending policies are followed 
generally, the risks are lessened. But protection against the principal 
hazards of a large debt structure—panics and prolonged depressions 
that weaken even strong debtors—is more likely to be found in public 
policies designed to prevent or mitigate them, notably monetary poli- 
cies. 


7. Interest Rates 


While the considerations reviewed thus far offer ground for believing 
that the volume of interest-bearing debt is likely to rise for some years 
to come, such a rise need not be accompanied by an increase in direct 
personal interest income. Whether the latter will also rise will depend 
partly on the movements in the level and structure of interest rates and 
partly on the extent to which individuals enlarge or reduce their direct 
holdings of government and corporate bonds, mortgages, and savings 
deposits. Without venturing a strong opinion on the probable direction 
of near- and intermediate-term changes in interest rates, we may note 
that the considerations cited below appear to indicate that a change in 
either direction is likely to be held to moderate proportions. 

We have previously noted that interest rates on new issues of corpo- 
rate and governmental debt securities and market yields on old issues 
fell almost continuously between 1932 and 1946 (Chart 3). Beginning 
in the first quarter of 1946, interest rates began an irregular two-year 
upward movement. A renewed decline in the three following years leit 
long-term rates modestly above their 1946 lows. Then a sharp rise in 
1951 and another in the first half of 1953 brought them to the highest 
levels in ten years. Between the beginning of 1946 and the middle of 
1953 the average market yield of all marketable long-term Treasury 
bonds advanced from 2.21 to 3.09 per cent; that of Moody’s Aaa corpo- 
rate bonds rose from 2.54 to 3.40 per cent. The Treasury Department 
raised the interest rates offered on United States savings bonds sold 
after April 30, 1952, and, for succeeding periods, on the amounts of 
previous issues maturing after that date that are not tendered for early 
cash redemption. 

The hardening.of interest rates was checked and reversed in the 
second half of 1953 as the combined result of a recession in business 
activity and of actions taken by the Federal Reserve authorities to 
increase the free reserves of commercial banks. In December 1954 the 
market yields of long-term federal and high-grade corporate bonds 
were roughly midway between their lows of the spring of 1946 and their 
highs of June 1953. 

It is possible to conjecture that the net rise since 1946 may be only 
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the first leg of a long-term ascending movement. Chart 9, similar to one 
used by Raymond Goldsmith in connection with an unpublished ad- 
dress, depicts the course of yields on high-grade long-term corporate 
bonds for the last century. It will be noted that the twenty-five-year 
downward movement in interest rates that culminated in 1946 had 
been preceded by a twenty-two-year period of rising interest rates, 
1899-1921, and that the latter rise, in turn, had been preceded by a 
thirty-year decline. Empirical parallelism suggests the possibility, 


CHART 9 
A CENTURY OF LONG-TERM INTEREST RATES, 1850-1953 
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though it obviously does not establish, that we began the ascending are 
of long cycle of interest rate movements in 1946 and that the upward 
movement might continue for another decade or more. 

But it would be dangerous to rely heavily upon historical parallelism 
in this connection. Notable changes have occurred in the central bank- 
ing organization of the United States and other countries. These have 
diminished dependence upon gold as a source and regulator of the 
quantity of money and bank credit. The changes have correspondingly 
increased the power and disposition of the central banking authorities 
to deliberately influence interest rates by altering the amount and cost 
of the reserves they make available to commercial banks. 

In the use of their discretionary powers, moreover, central banking 
authorities have been and are still subject to strong pressures of 
opinion in favor of low interest rates. During the Great Depression the 
deliberate promotion and maintenance of low interest rates became an 
avowed objective of many governments, first as an antidepression ex- 
pedient, then as a long-run measure to combat secular unemployment. 
There was a striking shift of concern and emphasis on the part of many 
scholars and public officials from the problems of short-run cyclical 
instability in prices and business to the long-run difficulties of main- 
taining a high level of employment and output. Where depressions were 
once regarded as brief interludes in the steady march of economic prog- 
ress, even to be welcomed in mild doses for their cathartic action upon 
the inefficiencies and wastes that developed during booms, they came 
to be regarded by many as an inveterate tendency of our economy, a 
ceaseless threat that needs to be combatted by all manner of direct and 
rearguard action. Frequent central banking adjustments in both direc- 
tions had commended themselves highly when credit policy was 
viewed as the helmsman steering the economy through a very narrow 
channel between inflation and depression. But when fear of a secular 
tendency toward depression took hold, there was concern that the 
effects of restrictive short-run credit policies might long outlast the 
occasioning circumstances and reinforce this long-run tendency. A 
continuous promotive policy, with low interest rates a prominent fea- 
ture, therefore came to be advocated by some. 

In the changed economic climate of the years 1939-1953, this type of 
thought receded from prominence. The 95 per cent rise in the gross 
national product of the United States (in constant prices) between the 
first and last year of this period (accompanying an increase in the 
civilian labor force of 15 per cent) did much to erase the former fears of 

long-run tendencies toward depression. At the same time, the doubling 
of the price level—the consumer price index rose 92 per cent and the 
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wholesale price index, 120 per cent—again demonstrated the reality of 
the danger of inflation under favoring conditions. Faith in the useful- 
ness of a restrictive monetary policy in appropriate circumstances was 
restored to many who had lost it. 

Nevertheless the Great Depression left an abiding legacy of public 
concern about maintaining a high level of employment, and a readiness 
to use the powers of government to this end—as evidenced by the 
declaration of congressional policy in the Employment Act of 1946. 
Monetary policy in the form of actions designed to avoid or reverse 
undue credit stringency is looked to by many to provide the first and 
most desirable method of defense against incipient business depres- 
sions. It seems reasonable to believe, therefore, that although the 
Federal Reserve authorities will doubtless permit market forces to 
move interest rates upward when loan and investment demands are 
heavy, and will find it necessary to deliberately restrict credit and 
thereby raise interest r:..es from time to time in order to limit specula- 
tive excesses or inflationary developments, they will have powerful 
motives to moderate upward movements in interest rates. 

Moreover, there is some evidence that the financial markets and 
business generally are highly sensitive to the degree of credit restraint 
required under present conditions to produce more than moderate 
absolute advances in interest rates. The greatly enlarged volume of 
marketable securities held by commercial banks and other institutions 
tends to exert an influence in this direction because even small in- 
creases in interest rates cause sizable reductions in the market value of 
their holdings of medium- and long-term bonds. And even moderate 
advances in long-term market yields tend to shrink the volume of 
funds available from private sources for government-sponsored lending 
programs, such as FHA and VA. 

The degree of credit restraint needed to produce a sizable advance in 
interest rates must be ‘sufficient to overcome the depressive influence 
upon market yields of the great and growing institutional demand for 
high-grade obligations previously discussed—a demand that is fed to an 
important degree by the contractual and noncontractual current liquid 
savings of individuals. Under these conditions, it may well be that a 
degree of deliberate credit restraint which causes relatively small abso- 
lute advances in interest rates may be adequate for policy purposes. 
It may be noted that the restrictive monetary policy pursued by the 
Federal Reserve authorities in the early months of 1953, in conjunction 
with the restrictive debt management policy of the Treasury (the 
offering of a long-term bond issue), had, in the words of the January 
1954 Economic Report of the President, “. . . a more potent effect than 
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was generally expected.” Yet the peak average monthly yield of 
Moody’s Aaa corporate bonds (in June 1953) was only slightly more 
than 0.4 per cent higher than eighteen months before; and the 3.40 per 
cent average yield in that peak month was little more than one-half the 
peak levels of 1920 or 1921, and substantially below the lowest average 
annual yield of these bonds in any year between 1919 and 1935 (Table 
6, Chart 3). The moderateness of the absolute rise in interest rates in 
1951-1953 is the more impressive because the three-year period was one 
of new high records in the volume of private investment and borrowing, 

Illustrating the sensitivity of long-term interest rates on the down- 
side to a softening in the business situation and to credit-easing actions 
of the Federal Reserve authorities was the rapid retraction in the 
twelve months ended in June 1954 of about one-half of the previous 
seven-year net advance in long-term interest rates. The Reserve System 
was quick to purchase about $i billion of Treasury bills in the open 
market in May and June 1953, and to announce, late in June, a reduc- 
tion in the percentage reserve requirements of member banks. These 
two measures added somewhat more than $2 billion to the lendable 
reserves of the banks. 

The improvement in the “quality” of large amounts of debt that re- 
sults from better information about borrowers, greater mobility of in- 
vestment funds, FHA insurance, and VA guarantees also operates to 
reduce effective interest rates. Aided by telephone and telegraph com- 
munication and a nationwide network of dealers and brokers in debt 
securities, and lending agents for insurance companies, the institutional 
demand makes funds available at competitive rates to areas and bor- 
rowers, physically remote from the central money markets, that for- 
merly paid higher rates by reason of their distance. The effective rates 
of interest on obligations not widely known, but otherwise of good 
quality, though rising and falling with other interest rates, are now 
less likely than formerly to make exaggerated responses to tighter 
money markets. Mortgages insured under the FHA or guaranteed by 
the Veterans’ Administration are not limited to a purely local market; 
and even uninsured mortgages, wherever issued, may tap through local 
lending agents the resources of insurance companies operating on a 
national scale. The effect of these developments is to promise a large 
and continuing demand for the obligations of lesser-known borrowers 
and to reduce that fraction of the so-called interest rate which actually 
represents compensation for risk, restricted marketability, and re- 
stricted access to lenders. 

On the other hand, although downward movements of interest rates 
are likely during business recessions, both in response to reduced de- 
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mands by borrowers and to deliberate action of the Federal Reserve 
authorities, a decline to the very low levels of the middle 1940’s appears 
unlikely. These levels were reached in unusual conditions. A combina- 
tion of a depression of greater intensity and duration than had been 
previously experienced and huge additions to our banking reserves 
through imports of gold from Europe had driven interest rates down 
with little interruption from 1932 to 1940. Then, during most of the 
1940’s, direct government controls over steel, copper, and other vital 
materials restricted private demands for borrowed funds, while the 
needs of war finance encouraged an easy-money policy on the part of 
the Federal Reserve System and the Treasury. In the absence of a 
major war or a severe and prolonged depression, it appears unlikely 
either that Federal Reserve policy would seek a similar degree of 
monetary ease or that other forces would bring it about. Supporting 
the likelihood of firmer interest rates in the intermediate future than 
the levels that obtained during the middle 1940’s, at least, is the greatly 
enlarged demand for capital funds by industry, state and local govern- 
ments, the housing market, and foreign countries. 


8. Recent upturn in direct monetary interest income of individuals likely 
to continue, but to lag behind growth in total personal interest income, 
direct and indirect 


Even if greater changes in interest rates should occur than those here 
contemplated, they would affect the total of individuals’ monetary 
interest income only gradually and far less than proportionally in the 
near- and intermediate-term future. One reason for this is that the 
average rate of interest received by individual investors on their total 
holdings naturally moves more slowly than the market rate because 
the new rates are obtained only on the new components of investors’ 
total holdings. A second reason is that a considerable fraction of ind* 
viduals’ interest-yielding investments is in forms that are relatively 
slow to respond to changes in market rates of interest. At the end of 
1953, $91.6 billion of individuals’ interest-bearing investments con- 
sisted of savings accounts with commercial banks, mutual savings 
banks, savings and loan associations, and the postal savings system 
(Table 26). An additional $49.3 billion was in United States savings 
bonds. Mortgages accounted for about $23 billion.* If we assume that 
individuals at the end of 1953 owned approximately the same propor- 





% Federal Reserve Bulletin, June 1954, p. 629. The estimate is a residual for individuals “and others” 
derived by subtracting from the estimated total mortgage debt outstanding the amounts held by com- 
mercial banks, trust companies, mutual savings banks, life insurance companies, building and loan 
associations, HOLC, FNMH, and VA. Some mortgages held by other federai agencies and by business 
corporations are included in the holdings ascribed to “individuals and others.” 
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tions of outstanding marketable bonds of all kinds as in Goldsmith’s 
estimates for 1949, such securities constituted 44 per cent of their total 
interest-bearing investments 

With the prospect of a further expansion in interest-bearing debt, 
and of an average interest rate not more than moderately different from 
the prevailing one, personal monetary interest income is likely to in- 
crease in the near- and intermediate-term future. But the growth in 
individuals’ direct monetary receipts of interest is likely to lag behind 
that in total personal interest income. 

Between 1943 and 1953, the direct monetary interest income of indi- 
viduals and nonprofit organizations rose each year (Table 11). The 
annual amount moved up from $3.4 to $7.5 billion during the decade. 
The increase in total personal interest income, monetary and imputed, 
as measured by the Department of Commerce, was considerably greater 
absolutely, and somewhat greater proportionally, the annual amount 
rising from $5.8 to $13.5 billion. The latter figures, as previously noted, 
include not only the direct monetary interest received by individuals 
and nonprofit organizations, but also the value of services performed 
for them by financial institutions, the value being measured by the 
amount of the latter’s net property income expended to provide such 
services. 

Individuals will doubtless acquire directly some portion of the con- 
tinuing increases in the total of mortgage obligations, public utility and 
other corporate bonds, and state and local obligations, although finan- 
cial institutions may be expected to be the principal purchasers. A 
further increase in individuals’ interest receipts will tend to occur as 
portions of their existing holdings of bonds that were acquired when 
interest rates were lower mature and the proceeds are reinvested—un- 
less interest rates should lose their net rise of the last several years. 
Individuals would also be likely to enlarge their holdings of’ United 
States government securities, possibly at the expense of equity invest- 
ments or savings deposits, if the Treasury were to refund with long- 
term bonds any substantial amounts of the large short-term federal 
debt now held mainly by banks, other financial institutions, and busi- 
ness corporations. , 

But several influences will tend to moderate the growth of individuals’ 
direct monetary interest receipts in the near- and intermediate-term 
future. One of these is the volume of home mortgage repayments. For 
millions of relatively recent home purchasers, earlier-than-scheduled 
repayments of installments on the principal of their mortgage loans will 
offer a more attractive rate of return (in the form of reduced interest 
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charges) than the alternative investments open to them, and will be 
preferred on other grounds as well. The increase in home-mortgage 
indebtedness outstanding in recent years has been accompanied by a 
substantial and rising volume of “apparent” retirements—“apparent” 
because the figures include old mortgages paid off w’’ 1 the proceeds of 
new mortgage loans as well as other full payments, partial prepay- 
ments, and regular amortization. The figures for recent years are as 
follows :* 


MORTGAGE LENDING ON 1- TO 4-FAMILY NONFARM HOUSES 
(In Billions of Dollars) 








New Apparent Increase Outstanding 
Year retire- in out- at end 
ments standings of year 





1949 ‘ : , 37.5 
1950 ; ‘ ‘ 45.1 
1951 . ; ° 51.9 
1952 . ‘ . 58.7 
1953 ‘ ‘ ‘ 65.9 





Although the available data do not permit ready separation of repay- 
ments through mortgage refinancing from repayment through other 
means, it is clear that the latter will shortly reach sizable amounts be- 
cause most of the large volume of home mortgages made in recent years 
calls for complete extinction of the debt through monthly payments 
over a period of from fifteen to thirty years. 

Related to the foregoing in some respects is the possibility of heavy 
cash redemptions of United States savings bonds issued in the last year 
of World War II and the several following years, now approaching 
maturity. Although the, Treasury offered a ten-year extension of ma- 
turity at a slightly higher interest rate to all holders of the maturing E 
bonds, and automatically extended the maturity of E bonds not pre- 
sented for cash redemption or exchange, a large proportion of the bonds 





% Operating Analysis Division, Home Loan Bank Board, and Federal Reserve Bulletin. 

The “New Loans Made” are actually recorded nonfarm mortgages of $20,000 or less. At present 
there is no reliable raethod of identifying the relatively small proportion of mortgages over $20,000, but 
these omissions appear to be approximately compensated by small nonresidential loans and junior 
mortgages of less than $20,000 on larger than 1- to 4-family nonfarm houses and nonresidential property. 
Although precise estimates on home mortgage lending are not available, the recording series provides a 
very good gauge of the amount of permanent financing activity for all 1- to 4-family nonfarm homes. 
Prior to 1951 when t} Home Loan Bank Board prepared direct estimates of new loans on 1- to 4-family 
nonfarm houses, the differences in our data, for the years 1949 and 1950, exceeded the old series by 0.8 
and 0.2 billion respectively, about 7 and 1 per cent for these two overlapping years. 
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maturing in 1952 and 1953 was presented for cash redemption. One use 
for the funds so obtained is reduction of a home-mortgage debt. 

Complicating the effect of savings bond redemptions on individuals’ 
cash interest iacome is the fact that many holders will realize interest 
from the bonds for the first time in a legal sense when they turn them in 
for redemption. In other words, because most holders chose not to re- 
port their accruing discount until redemption, the act of reducing their 
interest-yielding assets will temporarily increase their interest income. 

Conceivably, by more generous terms or more aggressive promotional 
efforts, the Treasury could maintain or increase the importance of sav- 
ings bonds as a source of personal interest receipts. It succeeded in 
maintaining the aggregate amount held by individuals at somewhat 
above $49 billion, measured by current redemption value, in 1950-1953, 
a period in which redemptions ran between $5 and $6 billion annually. 
In the same four years, it may be noted, individuals increased their 
savings accounts in banks, building and loan associations, and the 
postal savings system by $21.4 billion (Table 26). 

Continuing to contest with savings bonds and other direct interest- 
yielding investments for the savings of individuals will be such invest- 
ment intermediaries as life insurance companies and pension funds, 
most of whose interest earnings are not paid out as such. In the same 
four-year period, 1950-1953, in which individuals’ holdings of savings 
bonds remained stationary, the reserves of life insurance companies 
rose by $15.3 billion, and those of private pension funds by perhaps 
one-half as much. 

The amounts of interest income reported by individuals for income 
tax purposes may conceivably increase much more than their total 
interest receipts in the intermediate-term future. Because of the im- 
portant degree of underreporting of interest on tax returns, stricter en- 
forcement procedures may result in substantial additions to: the 
amount of personal interest reported. As in Great Britain, such pro- 
cedures might conceivably include provision for the deduction at the 
source and remittance to the Treasury by all important payers of 
interest of an amount equal to the first bracket tax rate on all interest 
payments. Even in the absence of such specific procedures, more 
thorough education of taxpayers and more comprehensive examination 
of tax returns may yield important results. 





CORRIGENDA 


Readers and authors are invited to submit corrections to papers pub- 
lished in any previous issue. These will be published each year, in the 
December issue. 


Anderson, R. L., THE PRoBLEM OF AUTOCORRELATION IN REGRESSION 
Ana.ysis, Vol. 49, No. 265 (March 1954), 113-129. 

On page 119, line 18 should read >>; >>; b:b;S(AX,AX;) instead of 
Di Des bidsS(AX;—AX)). 


Coale, Ansley J., THE PopuLaTION oF THE UNITED SraTsEs IN 1950 
CLASSIFIED BY AGE, Sex, AND Cotor—A Revision or Census Fic- 
urgEs, Vol. 50, No. 269 (March 1955), 16-54. 

On page 19, line 4, read: (with female rates reduced more than the 
male), instead of: (with male rates reduced more than female). 


Coe, Paul F., NoNwuHITE PopuLATION INCREASES IN METROPOLITAN 
Areas, Vol. 50, No. 270 (June 1965), 283-308. 

Mr. Richard L. Forstall of Rand McNally and Company has kindly 
pointed out that, since Laredo, Texas is located in the South, the last 
sentence on page 305 should be corrected to state, “Only one of these 
20 SMA’s was located in the South.” 


Errata, Vol. 49, No. 268 (December 1954), 907. 

John W. Tukey was erroneously listed as a co-author with Laderman 
and Littauer. Line 14 should read Laderman, J., Littauer, S. B., and 
Weiss, Lionel. 


Hildreth, Clifford, Point Estimates or ORDINATES OF CONCAVE 
Functions, Vol. 49, No. 267 (September 1954), 598-619. 


A hastily attempted extension of the main result of this paper can 
be shown to be in error. Fortunately, the main res’ilt and the illustra- 
tive example are unaffected by the error. Deletion of the following 
material will remove references to the false extension without disrupt- 
ing the organization of the article. 

Lines to be deleted: 

p. 604, line 5 starting with “or one might... ” through line 9. 

p. 613, lines 9 through 23. 

p. 615, lines 7 and 8. 

In addition, the following minor errors were not detected in proof- 
reading: 

p. 602, eqn. (2.7), the period at the end of this line should be deleted. 

p. 604, line 2, “x” should read “2.” 

p. 605, line 15, “by” should read “be.” 
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p. 612, third line from bottom, “6” should read “4.” 

p. 613, second line from bottom, “the” should read “and.” 

p. 615, eqn. (4.10), “(X. V. W)” should read “(X, V, W).” 

p. 617, twelfth line from bottom, insert “a” between “of” and “K.” 

p. 618, seventh line from bottom, “e” should read “3,” 

p. 618, fourth line from bottom, the bar above the m in {v™} should 
be deleted. 


Jackson, J. Edward, and Ross, Eleanor L., ExTENDED TABLES For 
Use with THE “G” Trst ror Means, Vol. 50, No. 270 (June 1965), 
416-433. 

On page 416, line 11, read: 

















n= X?—(=5X) . nz X? — (= X) 
i’ instead of 
n(n — 1) n(n — 1) 
On page 418, in the title of Table I, read: 
xX - G,= |X - 
hs tog instead of : 7 a | . 


Kimball, Bradford F., PracticaL APPLICATIONS OF THE THEORY OF 
EXTREME VALUES, Vol. 50, No. 270 (June 1955), 517-528. 
On page 526, line 6, read S* the mode, instead of S* the median. 
On page 528, reference [13], E. J. Gumbel’s name is misspelled. 


Rider, Paul R., Truncatep BINOMIAL AND NEGATIVE BINOMIAL 
DistrisvutTions, Vol. 50, No. 271 (September 1955), 877-883. 
On page 879, line 1, the exponent —k should be —m. 


Stoller, David S., UNIvARIATE Two-PoPpuLATION DISTRIBUTION-FREE 
DISCRIMINATION, Vol. 49, No. 268 (December 1954) 770-777. 

On page 772, line 10 should read Sy_,,%(z)=h/(N—m) instead of 
Sym (z) =h/N —m. 

On page 775, line 7 should end in <. Correspondingly, in line 11, the 
words “equals about” should, in both occurrences, be changed to “is 
less than.” 

It should be remarked that, as a consequence, the bias in Q (¢*) for 
this example is not so large as originally stated. 

The reasoning leading to the inequality is that 


Pr {| Q(¢*) — Q()| > 8} 
= Pr { | max (k/m — h/m)| > 28} 


< Pr {max| k/m — h/m}| > 28}, 
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BOOK REVIEWS 


Evaluation of 1954 Field Trial of Poliomyelitis Vaccine: Summary Report. Polio- 
myelitis Vaccine Evaluation Center. University of Michigan, Ann Arbor, Michi- 
gan, 1955. Pp. xiv, 51. No price. See review article on pages 1005-1013. 


Exercises in Theoretical Statistics. M.G. Kendall. New York: Hafner Publishing 
Co., 1954. Pp. vii, 179. $3.75. 


Herman Cuernorr, Stanford University 


oop examples in theoretical statistics are not easy to find. The author 

has compiled this list of 400 problems of varying grades of difficulty and 
suggested solutions to help in his teaching of statistics. (Many of the prob- 
lems were obtained from examinations at several English universities.) This 
reviewer feels that the need for a source of problems was so great that the 
usefulness of these problems far outweighs any shortcomings the book may 
have. In fact, Iam so enthusiastic about the appearance of such a book that 
I find it hard to say a harsh word about it and even feel that the errors in it 
have a useful function. 

It may develop that after several years of use defects will be discovered 
which will arouse criticism and that this book will be followed by others as 
good or better. In the meantime Kendall should be applauded for having 
satisfied a very definite need. 

One finds that in this book there are not many problems of general theo- 
retical interest in inference. Most of the problems of a general nature deal 
with distribution theory and most of the problems in inference involve special 
examples. In a book of only 400 problems, however, it is debatable whether 
this can be considered a defect. In any case, the special examples deal with 
a rather wide variety of topics of interest among statisticians. 

The essential prerequisite for a student working on these problems is a 
year’s course in mathematical statistics, with a calculus prerequisite. It is 
also desirable that the student be acquainted with advanced calculus and 
matrix theory, and have more than one year’s experience in statistics. These 
problems are very well suited for graduate students in American universities 
who often spend years studying advanced theory without ever hearing of 
cumulants, partial regression coefficients, or the method of moments, and 
tend to avoid any contact with specific applications. 

The problems have been arranged in five main groups: Distribution Theory 
(100 problems), Sampling Theory (100 problems), Relationships (75 prob- 
lems), Estimation and Inference (75 problems), and Time Series (50 prob- 
lems). 

In the first group, most of the problems involve moments and character- 
istic functions of specific distributions or general properties of moments. Sev- 
eral queueing and traffic problems are also treated. One of the problems yields 
an illustration of the fact that two characteristic functions can coincide over 
an interval without being identical. 
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In the group on sampling, the problems are essentially similar except that 
they involve distributions of statistics based on samples of size larger than 
one. The third group is quite similar also except that the emphasis mainly is 
on correlation and slightly on contingency. 

The group on estimation end inference mainly consists of interesting prob- 
lems in various fields where statistical theory has been applied. Readers who 
are not theoretical statisticians will probably be most interested in reading 
some of these problems. 

The final group involves time series. It deals with moving averages, serial 
correlations, periodograms, and spectra. I would have preferred to see more 
examples involving runs and birth and death processes. 

In reviewing the book, I did some of the problems both because of their 
interest and to check for possible errors. The results must necessarily be bi- 
ased because as the pressure of time rose, the density of problems done had 
to decline sharply. Hence the first group was by far the most thoroughly 
checked. The proposed solution of problem 24 involves a rather interesting 
inequality which is not always correct. The results claimed in problems 4 
and 58 are incorrect. In a few problems the questions could have been posed 
more precisely or conditions were implied but not stated. These are of minor 
importance, though. For example, in (25) where it is stated that a function 
has a single maximum, presumably a single relative maximum is meant. In 
(28), r>1 is implied. In (79) the condition 1, j, k, 1 distinct is implied. In 
(113) the wording is poor and the expression for a variance is a random vari- 
able. In (144) = was omitted and in (214) u=azr+by is the correct definition. 

While the presence of errors in scientific papers is to be deplored, it is my 
impression that there are too many students who are perfectly capable of 
giving proofs for incorrect statements. Because of this I would have preferred 
to have some serious errors and vaguely stated problems purposely scattered 
throughout the book. The careful checking that the author and his friends 
and helpers did has been too effective for my taste. 

It is difficult to quarrel with the choice of problems in such a book. I be- 
lieve that the author was correct in not stressing combinatorial probability. 
I would prefer to have more problems dealing with asymptotic resuts, general 
results in inference and some elementary problems in decision theory. On 
the other hand there undoubtedly were substantial reasons for not inserting 
such problems. 

I hope that this fine work will soon be supplemented by others, for the 
need for good problems is great among developing theoretical statisticians. 


The Methods of Statistics, Fourth Edition. L. H. C. Tippett. New York: John 
Wiley and Sons, Inc., 1952. Pp. 395. $6.00. 


CHURCHILL E1senHART, National Bureau of Standards 


™ First Edition of this book, published in 1931, filled a then urgent 
need for a presentation of the methods of both the Pearsonian and 
Fisherian schools in comparatively simple terms. The First Edition was an 
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immediate success. The present reviewer is one of many who today can 
state that it was from reading and rereading Tippett’s 1931 book that he 
first gained a real understanding and full appreciation of the New Look in 
Statistics, which began with the appearance of R. A. Fisher’s Statistica] 
Methods for Research Workers. 

This Fourth Edition, like the First, is an important book. Chapters 
3-12 have to do primarily with statistical inference. They constitute the 
bulk and the most important part of the book. Here are treated the usual 
elementary tests of significance and confidence-limit procedures for samples 
from normal, binomial, and Poisson populations. Analysis of variance, 
linear and nonlinear regression, simple, multiple, and partial correlation 
are discussed quite fully. Sampling and the principles of experimental ar- 
rangements are touched upon more briefly. Other topics considered briefly, 
yet instructively, include: combination of probabilities from independent 
tests of significance, tests for association in rank data, analysis of co-vari- 
ance, orthogonal polynomials, analysis of time series, and the use of trans- 
formations to simplify statistical analyses. A fine feature of these chapters 
is the clear and well organized style of the writing. Some of the more com- 
plicated concepts are discussed in such a thorough and simple manner that 
the concepts themselves appear to be almost obvious. There is a general 
appeal to intuition and an abundant use of pictures, which should be espe- 
cially helpful to the beginner. Illustrative examples, based on actual data 
from biological or industrial sources, are numerous, realistic, and instruc- 
tive. 

From a careful study of the Introduction and Chapters 3-12, a non- 
statistician will become acquainted with ideas and techniques that are 
new to him, will begin to see statistics as a general method of research that 
can be applied to good advantage in his own field, and will come away with a 
working knowledge of particular methods of statistics. He will see just 
exactly what these particular methods aim to accomplish, what they can 
and cannot do, and he will begin to have some ideas on where, when, how, 
and to what immediate ends they might be employed to advantage in his 
own work or field. 

The first two chapters serve to introduce frequency distributions, aver- 
ages, measures of dispersion, moments, mathematical probability, and the 
binomial, Poisson, normal, and chi-square distributions. The exposition here 
is not, in the opinion of this reviewer, up to the high standard of quality 
maintained throughout the rest of the book. Thus, the author made no 
attempt to distinquish clearly between a population parameter and a sample 
function that serves as an estimator of the parameter until he is forced to do 
so, in a footnote, by the requirements of a mathematical argument on page 
83. He never clearly distinquishes between an estimator and an estimate. On 
the other hand, the discussion of randomness is exceptionally well done; 
and the present reviewer was particularly delighted to see the explicit dis- 
cussion (on pp. 60-61, 67-69) of the relation of the Poisson distribution 
to the exponential distribution. 
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In summary, this is a very readable general introduction to the methods 
of statisties. A scientist or engineer without formal training in statistical 
theory and methodology will find here nearly everything he is likely to need 
in the performance of his regular duties. The mathematics required is just 
about what he can use with ease, although he may have been exposed to 
more. The auxiliary tables and charts included in the Appendix are suffi- 
cient for all ordinary applications of the methods and procedures discussed. 
Tippett writes with the authority of one who has personally applied statis- 
tical methods with telling effect in a variety of fields, and with the skill of 
one from whom many others have over the years learned to do likewise. A 
scientist or engineer who merely peruses a chapter or two will certainly 
become statistically minded, and if he reads the entire book he will be con- 
siderably enriched. Professional statisticians, and others with formal train- 
ing in statistical theory and methodology, will gain valuable information 
from a perusal of this volume on how to present statistical concepts and 
techniques instructively and persuasively to practical men in other fields. 


Elementary Statistics. John M. Howell and Ben K. Gold. Dubuque, Iowa: Wm. C. 
Brown Company, 1954. Pp. v, 154. $3.00. Paper. 


Frep C. AnprEews, The University of Nebraska 


wo to the authors, this book is an attempt at satisfying a need for 


an elementary statistics textbook that would (1) “emphasize the fun- 
damental principles and techniques which are applicable to all fields where- 
in modern statistics is useful,” (2) “be intelligible to the student who has 
had no more mathematics than ordinary first year algebra,” and (3) “Its 
length should not be too imposing.” Taking these requirements in reverse 
order we see that the book has 142 pages of text divided into 9 chapters and. 
should certainly be about the amount of material for a one semester course. 
The mathematical level of the book is that of first year algebra. Of course 
no mathematical proofs or derivations are presented. There is a chapter 
giving the necessary arithmetic, which would be very helpful. For the most 
part, facility with formulas is developed as the book proceeds. The topics 
considered are those of rather wide application. After chapters of introduc- 
tion and arithmetic, the table of contents reads “Measurements—Quantita- 
tive Data”; “Attributes—Qualitative Data”; “The Normal Distribution” ; 
“Estimation, Confidence Limits, and Significance Tests for Large Sample 
Sizes”; “The Chi-square Distribution”; “Correlation”; “Descriptive Sta- 
tistics”. 

The authors stress heavily the computational techniques of the topics 
considered. In each chapter there are ample exercises and a, review exam- 
ination complete with answers. The ayplication of a formula is generally 
demonstrated by the use of real data. It is disappointing to find that the 
problems are not adequately connected by a flowing discussion. The brevity 
of text is the main fault of the book. The emphasis on computation and mini- 
mum of explanation gives the book the appearance of a statistical recipe 
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collection rather than an introduction to statistical reasoning. The text 
resembles a set of classroom notes intended for review use rather than ma- 
terial intended for initial learning. I believe that it would be difficult to 
learn more than formulas by independent reading of this book alone. 

This volume can be recommended as a reasonable problem source. Its 
spiral binding and flexible cover make it easy to use. Also its modest price 
is attractive. I do believe; however, that if principles of statistical reasoning 
are to be presented along with computational procedures, from this book, 
it will be necessary to devise a rather complete set of accompanying lec- 
tures. 


Elements of Statistics. H. C. Fryer. New York: John Wiley and Sons. 1954. 
Pp. viii, 262. $4.75. 


AnpDRE G. LAuRENT, Michigan State University 


| yoummed book, which is intended for students whose mathematical back- 
ground does not go beyond college algebra, is one of the best—if not 
the best—introductory text in elementary statistics the reviewer knows of. 

The subject matter has been restricted to a limited number of topics: 
(1) Population and Frequency Distributions (Chapters II and IV); (2) Ele- 
mentary Probability (Chapter ITI); (3) Sampling from Binomial and Nor- 
mal Populations (Chapters V and VI); (4) Linear Regression and Cor- 
relation (Chapter VII). The material covered is well chosen, thoroughly 
discussed and vividly illustrated by a broad variety of worked examples. 

The author is careful to introduce progressively the fundamental concepts 
and the underlying principles of statistical method. Every effort is made to 
make them clearly understood. Though the student is provided with the 
routine methods fundamental to all the applications and even a few recent 
techniques (the G test is presented) the emphasis is on the basic ideas rather 
than on the technical aspects of statistics. 

The treatment is well balanced, and avoids the pitfalls of overemphasis 
on one systematic method of approach. The book succeeds in not sacrificing 
too much rigorousness to simplicity. A few examples will show that only 
minor criticisms can be made. Wald’s name is not mentioned in the Section 
devoted to history; the use of the word “normal” on pages 16 and 17 is 
somewhat ambiguous; on page 25, frequency polygons are used rather than 
histograms to represent distributions; on page 68, the assumption of inde- 
pendence in Bernoulli’s trials is not stressed; an allusion to truncated dis- 
tribution on page 108 could be easily misunderstood. 

The style is direct and lively, the examples look realistic; these features 
prevent the text from being dull. Though the number of statistical publi- 
cations is increasing each year, there is a critical shortage of books of sim- 
ilar quality on that level. This new addition to statistical literature will be 
especially welcome by elementary statistics teachers; it maintains the high 
standard of the series. 
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An Introduction to Statistics for the Social Sciences. T. G. Connolly and W. 
Sluckin. New York: Hafner Publishing Co., 1953. Pp. vii, 154. $2.75. 


Howarp Rarrra, Columbia University 


HE goals of the authors are clearly stated on the jacket cover of this book: 
Ts filling a gap in the elementary literature of statistics this modest and 
useful book brings help to a wide variety of students whose studies demand 
some statistical insight, yet who have neither the equipment nor the incli- 
nation to pursue the subject deeply.” The book is written for those students 
who have no knowledge of mathematics beyond simple arithmetic; however, 
some very elementary algebraic manipulations are occasionally employed 
and formulas, in general, are stated in symbolic fashion together with 
supporting numerical illustrations. Undoubtedly, the strong points in favor 
of the book are its brevity (154 small pages), its simplicity of language, its 
problem oriented motivation. Simple illustrative examples are chosen from 
the social sciences. The book is well written from a literary standpoint, very 
carefully exposited, but from a content and technical point of view it falls 
far short of its desired goal. No exercises are included in the book. 

The first 64 pages of the book deal with the elements of descriptive statis- 
tics and topics covered are: raw data, grouping into class intervals, frequency 
polygons and histograms, measures of central tendency and variability, 
empirical cumulative distributions. These concepts are amply illustrated 
and precise algorithms for computation are given. The authors attempt to 
tell the student how to select among various population indexes (e.g., mean, 
mode, median) but never discuss why a choice is necessary. Throughout 
the book there is a generous sprinkling of statements which are incorrect 
when taken literally (in the sense that one can give counter-examples to the 
propositions asserted) but which, in most cases, constitute fairly sound 
advice. For example, the authors assert: 


“Use the arithmetic mean when: 

a) the greatest reliability is required, 

b) the distribution is reasonably symmetrical, 

c) when subsequent statistical calculations are to be made.” 


Reliability at this point of the book is meant in the sense of “fluctuation of 
the measurement from sample to sample of the total population.” Obvious- 
ly, this advice is not sound if absolute deviations from the measure of central 
tendency is a pertinent index of “fluctuation,” or if the population is of the 
Cauchy type. 

In Chapter V (pages 64 to 77) the normal distribution curve is arrived at 
from an intuitive limiting analysis of Bernoulli Trials (coin-tossing) where 
the probability of heads is 4. After this demonstration the authors state: 

“The reader will have noticed that the assumption underlying the game 
of coin-tossing is that the chances of throwing heads or tails are equal. In 


other games of chance different conditions obtain. For instance, in dice- 
throwing the chance of a six or of any other of the six members, is one in 
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six (1:6). We could have approached asymmetrical distributions along these 
lines. There is no need, however, for our present purpose to consider skew 
distributions in this manner. 

“It used to be thought at one time (a) that many distributions found jn 
nature were normal, and (b) that there were special reasons why they should 
be normal. Jt is not now thought that the second statement is true. As regards 
the first statement, all that can be said is that many natural distributions 
approximate the normal form more or less closely.” (Italics mine.) 


As is well-known, the underlined statements above are incorrect, for the 
Central Limit Theorem furnishes a very basic special reason why many 
distributions are normal and this same theorem asserts that no asymmetri- 
cal distribution could have been approached along the lines chosen by the 
authors (indeed, the limiting distribution is normal). In a book of this size 
it is reasonable that the Central Limit Theorem is not discussed but it is 
not reasonable that authors of a statistics text are not familiar with its 
implications. 

The latter half of the book deals with selected elementary topics of statis- 
tical inference. The authors’ viewpoints on statistical inference predate the 
Neyman-Pearson school for no mention is made of two types of errors, 
power of a test, etc. The basic ingredients of the inference problem are 
never clearly stated and this leads the authors to repeatedly fall into such 
elementary traps and propagate such misconceptions as exemplified in the 
quote below on confidence statements: 

“ ... Therefore at the one per cent level of confidence our limits are given 
by 7.61 +3.50 X2.04 or 7.61 +7.11 (approx.). This means that if we took a 
very large number of sets of readings, the chances are 99 in 100 that the true 


standard deviation (representing the measure of scatter of the readings) 
could be anywhere between 0.50 lb. and 14.72 lb.” 


Of course, what is meant is: that if repeated sets of readings are taken, and 
if for each set an interval is associated in the manner so carefully explained by 
the authors, then, almost certainly, the proportion of intervals containing the 
true standard deviation will approach .99; and, consequently, for the given 
reading, we have “confidence” .99 in the assertion that 7.61+7.11 covers 
the true standard deviation. 

In all tests of hypotheses, statements are made at the 1% and 5% levels. 
If experimental results are significant at the 5% and not at the 1% level, 
the authors conclude essentially that the issues are not clear-cut and that 
further experimentation is necessary. They do not say how large the next 
sample should be and how the total set of observations should be treated. 
Furthermore, they do not seem to be aware that the way the null hypotheses 
are formulated one can almost guarantee a significant difference at level 
1% provided the sample sizes are sufficiently large. 

The following topics are discussed in the inference section of the book: 

“Reliability” of estimated mean and standard deviation in small and large 


samples, significance of difference between means, contingency tables and 
x? tests of independence, x* test of agreement of empirical c.d.f. with normal 
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distribution, scatter diagrams and the product moment correlation co- 
efficient, reliability of correlation coefficients, biserial and tetrachoric corre- 
lation, Kendall’s and Spearman’s rank correlation coefficients. 


Mr. Connolly is an extra-mural lecturer in social psychology, University 
of London; Mr. Sluckin is in the Department of Psychology, University of 
Durham. 


Handbook of Graphic Presentation. Calvin F. Schmid. New York: The Ronald 
Press Company, 1954. Pp. vii, 316. $6.00. 


Kennetu W. Haemer, American Telephone and Telegraph Company 


— have been numerous attempts to bring together the published 
standards of statistical graphic presentation, and to fill in the empty 
spots (for which no “official” standards exist) with a consensus of the best 
current practices. This book is the most successful effort of this kind so far. 


' The author was confronted with the serious obstacle that two of his major 


sources are somewhat out of date. (Time Series Charts—the American 
Standard—was written in 1938; and Standards of Presentation issued by the 
Army, was written in 1946.) Despite this, he has managed to do a highly 
creditable job of producing a reliable and helpful handbook. 

Audience: In his preface, the author states that “This book has been 
written as a working manual for all who are concerned with the clear pres- 
entation and interpretation of statistics in grafic form.” Later, he suggests 
that it is intended primarily for students and beginning statisticians. These 
slightly contradictory aims are met by including some chapters of broad 
general interest, some of interest mainly to beginners, and some that will 
appeal to specialists. 

Content: This is a reasonably thorough book, although like most books on 
this subject, it gives the most attention to the author’s favorite kinds of 
chart and answers most fully the questions met in his major fields of interest. 
In this case the charts explained in most detail are semi-log charts, frequency 
charts, and three dimensional charts, each of which receives a chapter. Maps, 
another of the author’s favorite presentation forms, are also enthusiastically 
described at considerable length in a fourth chapter. 

The seven other chapters discuss the following topics: principles of chart 
design, drafting techniques, curve and surface charts, bar and column charts, 
miscellaneous graphic forms, pictorial charts, and chart reproduction. 

Most of the material in this book is well organized and presented. The au- 
thor has done the reader a disservice, however, by titling the chapter 
about curve and surface charts “Rectilinear coordinate charts” and then neg- 
lecting to explain that several of the other chapters are also about charts 
based on the same scheme of two scales crossing at right angles. 

Bar and column charts are discussed as though they were horizontal and 
vertical versions of the same thing. In fact, the author says “Basically they 
are identical.” True, they are similar in appearance, but basically bars are 
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one-scale comparisens; columns use the two-scale rectilinear coordinate 
system. 

The chapter on semi-logarithmic charts explains the mechanics of this 
type clearly but goes overboard on the virtues and importance of relative. 
rate-of-change presentation. Overenthusiastic praise such as this: “In 
comparison with the arithmetic line chart it possesses most of the advan- 
tages without the disadvantages” is likely to start beginners and students on 
the wrong foot. This would have been a better chapter if the author had 
explained clearly that neither of these charts has any inherent advantages or 
disadvantages; that each is intended to serve a different purpose, and that 
the disadvantage lies in trying to use one to do the work of the other. 

The chapters on frequency charts and statistical maps are generally 
excellent for the purposes of this book. Both are developed much more fully 
than in any similar book. Relationship charts, however, would benefit from 
a fuller treatment. The discussion of pictorial charts is extremely good. The 
chapter about three dimensional charts is exceptionally good, the most 
thorough treatment of this subject that this reviewer has seen. 

The final chapter outlines the major reproduction processes and empha- 
sizes the importance of preparing charts with a specific method of repro- 
duction in mind. This is a useful chapter; but it would be even more helpful 
if the space devoted to the history of reproduction processes had been given 
instead to further explanation of the uses and limitations of each process. 

Illustrations: In any book on graphic presentation, the illustrations are 
the proof of the author’s understanding of the subject. Professor Schmid’s 
are excellent. Except for a few exhibits borrowed from other sources, they 
are first rate examples of how charts and maps should be drawn and pre- 
sented. The number of illustrations is ample: in addition to many complete 
charts and maps, there are a large number of stripped-down “thumbnail” 
examples showing the major features of different chart types. 

Reliability: It is probably inevitable that every book will contain incon- 
sistencies. This book contains some; but most of them are minor. For 
example, the author warns against the omission of zero lines yet uses sev- 
eral illustrations that have none; he states that zero lines should be made 
heavier than other rulings but neglects to do this consistently. 

Most of the technical details in this book are sound and sensible. There 
are very few things in it that statisticians could quibble about. But there are 
some; typical examples: (double amount scales) “If zero values are omitted, 
the scales should be so adjusted that the zero lines would coincide if the 
scales were extended to zero.” This statement rules out many of the most 
useful double-scale charts: those in which the scales have been adjusted to 
bring out the similarity in the pattern of two series without regard for the 
magnitude of their changes. 

Style: The writing is generally excellent. On those topics where the author 
lets himself go, he is admirably clear and specific. Nevertheless, he has one 
small but persistent weakness that will disturb some readers: the habit of 
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stirring up interest in a topic, then dismissing it with a vague generality. 
Typical examples; page 162 (Frequency charts) “Logarithmic rulings may be 
used for both axes, or for either the X or Y axis, depending on the problem 
or purpose at hand”; page 79 (Bar charts) “Also, in a well designed chart of 
this kind the bars should not be disproportionately long and narrow or short 
and wide”; page 181 (Organization charts) “The various names, titles, and 
descriptive labels are generally enclosed in squares, circles, or other shapes, 
which are connected by various kinds of lines.” However, despite the 
presence of unresolved ideas, this book is well written and is easy to read 
and to understand. 

Summary: The chief value of this book is that it brings together the 
major standards of statistical graphic presentation, and supplements these 
with information about special forms and problems not covered in the offi- 
cial “standards.” It puts under one cover a valuable collection of graphic 
principles and procedures that the reader would otherwise have to search 
for in several volumes. Except when it leans too heavily on out-of-date 
sources, its recommendations follow the best current practices. It should be 
a valuable introduction for the beginner in graphic presentation, and a 
useful reference for the experienced chart maker. 


Limit Distributions For Sums of Independent Random Variables. B. V. Gnedenko 
and A. N. Kolmogorov. Translated from the Russian and annotated by K. L. 
Chung. With an Appendix by J. L. Doob. Cambridge, Mass.: Addison-Wesley 
Publishing Co., 1954. Pp. ix, 264. $7.50. 


WassiLy Hoerrpine, University of North Carolina 


HIs is a systematic treatment of the limiting forms and the asymptotic 

behavior of the distributions of sums of independent real-valued random 
variables as the number of summands tends to infinity. The work of recent 
decades, in which both authors played a prominent part, has revealed that 
the classical results such as the law of large numbers, the central limit 
theorem, and Poisson’s limit for the binomial distribution are merely special 
cases of general limit theorems which give the final answers to a substantial 
class of problems that have arisen in this field. The book presents the result- 
ing beautiful theory in a clear and simple form. The only other book 
in English which contains a similarly extensive, though less complete, 
treatment of this subject is Loéve’s recent Probability Theory. The translator 
K. L. Chung, has added explanatory notes, made a number of corrections, 
and incorporated some improvements from the Hungsrian edition. The 
reader is expected to have a knowledge of calculus to the extent needed in 
operating with characteristic functions which are the principal tool in the 
proofs. 

The book consists of three parts. The introductory Part I begins with the 
mathematical foundations of probability, based on the theory of measure, 
and goes in this respect somewhat beyond what is essential for the under- 
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standing of the main part of the book. Some basic questions raised in this 
first chapter are elaborated in more detail in an appendix by J. L. Doob. A 
discussion of distributions and characteristic functions is followed by a chap- 
ter ou infinitely divisible distributions, which are of fundamental importance 
for the subject of this book. A random variable is said to be infinitely divis- 
ible if for every natural number n it can be represented as the sum of n 
independent, identically distributed random variables. An infinitely divis- 
ible distribution is the distribution of an infinitely divisible random variable. 
Thus the normal, the Poisson, the Cauchy, and the Gamma distributions 
are infinitely divisible. So is the improper distribution which assigns prob- 
ability one to a single point. Paul Lévy’s representation of the character- 
istic function of an infinitely divisible distribution puts in evidence the wide 
extent of this class of distributions. 

In Part II the following general problem is considered. Let S, = Xni+X,: 
+---+X,., be a sum of mutually independent random variables whose 
number k, tends to infinity with n. Under what conditions do there exist 
constants A, such that the distribution of S,—A, tends to a limit distribu- 
tion, and what is the form of the latter? Familiar special cases are the law of 
large numbers (one form of which states that if k,=n and X,.=X;/n, 
where Xi, Xo2,°--, Xn,°°** are independent and identically distributed 
with mean y, then S, tends to u in probability and hence has an improper 
limit distribution), the central limit theorem, and Poisson’s limit theorem 
for “rare events.” In the general case it is assumed that the summands X,, 
are (asymptotically) infinitesimal, which means, roughly speaking, that 
each summand becomes negligibly small with high probability as n increases. 
We then have the theorem (due to Khintchine) that any infinitely divis- 
ible distribution, and no other, can be the limit distribution of S,—An,. 
Further theorems show how the constants A, and the form of the limit 
distribution (if it exists) can be determined from the distributions of the 
summands X,,. By imposing additional restrictions, necessary and sufficient 
conditions are obtained for the limit distribution to be of a particular type. 
It is important to note that the conditions for the convergence to a normal 
distribution are of a very general kind whereas for the convergence to any 
particular proper non-normal distribution much more special conditions 
are required. Thus the prominent role of the normal law as a limit distri- 
bution is not merely a historical accident. 

The last chapter of Part II is concerned with the special case where 
S,=Xit+ +--+ +X, is the sum of the first n terms of an infinite sequence of 
independent random variables. Here the problem is to find, if possible, 
constants B, >0 and A, such that (S,/B,) —A, has a limit distribution. It 
is assumed that the random variables X:/B,, -- +, X,/B, are infinitesimal. 
The class LZ of possible limit distributions under these conditions, which 
was determined by Paul Lévy, turns out to be a proper subclass of the class 
of infinitely divisible distributions. The Poisson distribution does not belong 
to this class. 

In the final Part III the further assumption is made that S, is the sum of 
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the first n terms of a sequence of independent and identically distributed 
random variables. The possible limit distributions of (S,/B,)—A, are the 
so-called stable distributions, which were investigated by Khintchine and 
Paul Lévy. The class of stable distributions is a small subclass of the class L. 
The only proper stable distributions with finite variance are normal. The 
Cauchy distribution also is stable. If the class of limit distributions in the 
proper sense is thus severely restricted under the present ass . options, any 
infinitely divisible distribution may be the limit distribution of a subsequence 
of (Sp/Bn) — An, a8 is shown in a theorem of Khintchine. 

Limit theorems are of interest in applications mainly insofar as the limit 
distribution may serve as an approximation of the distribution of a finite 
sum. The greater part of Chapter 8 is concerned with the rapidity of the 
approach of the distribution of S, to the normal distribution. Only the case 
of identically distributed summands is considered. Some of the bounds of 
Berry and Esseen for the difference between the distribution function of 
S, and the normal distribution function are derived. Asymptotic expansions 
of the type considered by Edgeworth are presented in the rigorous form 
due to Cramér and Esseen. The rest of Chapter 8 and the final Chapter 9 
are concerned with local limit theorems, that is, with the approach of the 
probability density or the discrete frequency function of the normalized 
sum to the probability density of the limit distribution. 

The statistician or physicist may wonder whether limit distributions of 
sums of independent random variables which are not of the normal, the 
Poisson, or the improper types are of any importance in applications. Some 
indications of “unusual” limit distributions which occur in applied work 
can be found in the preface to this book. The authors express the opinion 
that the theory of infinitely divisible distributions and the general limit 
theorems connected with them will receive in time diverse applications. If 
such applications so far are relatively scarce, this may perhaps be due to 
the fact that those working in applied fields are usually not aware of the 
existence of these general limit theorems. 


Psychological Statistics. Second Edition. Quinn McNemar. New York: John 
Wiley and Sons, Inc., 1955. Pp. viii, 408. $6.00. 


Leo Katz, Michigan State University 


™ second edition of McNemar’s book is about as good as the first (see 
review, this Journal, vol. 44, pp. 572-574). Some minor errors have been 
corrected, some of the omissions in the original version have been partially 
filled (or,at least, mentioned) and the order of presentation has been some- 
what altered. The first edition, at $4.50, was a better buy. 

About 95 per cent of the revision has been done with shears and pastepot. 
The first three chapters are unchanged except for (a) the insertion of a 
paragraph, attempting to define statistics and parameters, in the first 
chapter, (b) the insertion of five paragraphs, on the relation between fre- 
quency polygon and histogram, in the second chapter, and (c) a minor 
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revision of the first paragraph of the third chapter. The old chapters 4, the 
normal curve and probability, 5, sampling errors and statistical inference 
and 12, small sample methods (which previously followed correlation and 
contingency) are rearranged and served up with pedagogical advantage in a 
unit of four chapters headed distribution curves, probability and hypothesis 
testing, inference: continuous variables, and small sample of ¢ technique. 

As in the first edition, the naive reader is almost certain to form a set of 
incorrect ideas concerning inference about distribution means. He is likely to 
feel that the assumption of normality of distribution is about on the same 
level as the use of a sharp pencil—nice, but not exactly necessary. He will 
probably be convinced that the question of size of sample is all-important 
and, consequently, will abhor medium-sized samples. Dealing with “small” 
samples, he will use t whether he knows the variance or not. 

The remainder of the book, consisting of five chapters on correlation, 
one on chi-square, three on analysis of variance and one of notes on sampling 
is slightly improved by the addition of three secticns of twenty pages on 
models for the analysis of variance, a chapter of six pages on comparison of 
variabilities and a chapter of (only) four pages on distribution-free methods, 
Two insertions, of less than a page each, on discriminant functions and 
intraclass correlation, might better have been replaced by references to more 
adequate discussions. Two pages, added to the chapter on chi-square, dis- 
cuss exact probability computations for 2X1 and 2X2 tables but fail to 
mention the existence of tables giving these probabilities. 

The wide acceptance of the first edition and the probable duplication of 
that success in this second edition raise certain questions. This reviewer 
feels strongly that the practitioner of statistical inference must understand 
much more of his art than he brings to bear on a specific problem; there- 
fore, the “cookbook” approach cannot succeed. However, many disagree 
with this point of view. For these pragmatically-minded individuals, the 
measure of the cookbook is the quality of the cake produced. This kind of 
evaluation of books like McNemar’s seems impossible to achieve. In the 
absence of such evaluation, the reviewer can only assert his judgment that 
McNemar’s book is as good as any written in the past two decades for 
the purpose of putting the sharpest tools of the statistical inference trade 
in the untrained hands of mature, but mathematically naive, students of 
the social sciences. 


Factorial Analysis for Non-Mathematicians. C. J. Adock. Melbourne, Australia: 
Melbourne University Press, 1954. Pp. 88. 17s.6d. 


Wituiam G. Mapow, Stanford University 


} brief text is intended to cover “the essentials of factor analysis in 
simple non-mathematical terms. There seems to be a need for a book 
which will enable the student to get a quick insight into what is involved 
without laboring through any mathematical equations.” 
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The titles of the nine chapters provide a good outline of the contents. 
They are: 1. The logic of factor analysis, 2. Correlation, 3. Factors, 4. The 
Principle of Rotation, 5. Finding simple structure, 6. Cluster-directed 
analysis, 7. Combined centroid and group analysis, 8. Interpreting the 
results, 9. The role of factor analysis. In addition, there is an appendix and 
a brief list for further reading. 

The book is clearly written and has carefully selected illustrations. With 
supplementation presenting methods of factor analysis besides the centroid 
method, it could serve well as an introductory text in computing factors 
were it not for the absence of problems. I would think that a book stressing 
computing so much as this would contain problems whose answers appear- 
ed at the end of the book—particularly when the procedures involve as much 
personal decision as does rotation. 

It is unfortunate that, probably because of its brevity, the book contains 
so little discussion of the underlying logic of factor analysis. Factor analysis 
attempts to determine the underlying structure of abilities or other charac- 
teristics which have the property of being latent either because they are not 
measurable directly or because the researcher chooses not to measure them 
directly. Spearman, in his beginning work, made a certain hypothesis con- 
cerning the relationship of the latent and manifest variables and this hypoth- 
esis was later generalized in the work of those who followed. These 
hypotheses and their implications could be discussed with no more mathe- 
matics than the book already has. 

In order to make any estimate of the latent variables, it is necessary in 
some way to connect them with the manifest variables. The usual way in 
which this is done is to assume that the manifest variables are linear com- 
binations of the latent variables (an error of measurement variable is usu- 
ally included). At this point, it turns out that these simple hypotheses just 
are not definite enough to permit the estimation of the factors end conse- 
quently it becomes necessary to make further assumptions that enable one 
to make these estimates. Such assumptions have been made by Thurstone, 
Hotelling, Thomson, Burt, and others. Again, some discussion of the differ- 
ences among these methods might have been given. 

Whatever assumptions are made one can, if he wishes, rotate, but this 
again involves still further assumptions. These assumptions might aiso 
have been discussed in non-mathematical terms. 

As is well known, there are considerable errors of measurements in any 
psychological test. Consequently, when one is factoring, the question occurs 
how many factors one should take. Much work has been done of recent years, 
in attempting to get at least a preliminary answer to this question but this 
work is not referred to in the current volume. But the questions must be 
answered practically in one way or another in each factor analysis. 

I hope that with the development of high-speed computing machines, 
the very computational aspects to which this book is devoted, will become 
unimportant. There was a time when students working on their doctor’s 
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degrees in psychology in an area in which factor analysis was the method 
used, spent much more time computing the factors than or the psychological 
aspects of their theses. Today, one can derive a full factor analysis in a rather 
brief time on various electronic computers by using procedures other than 
the centroid procedure. Thus, the oft-repeated justification for the cen- 
troid procedure namely, its relative simplicity in computation, is ceasing to 
exist and other methods may well take over. In a sense, this is as it should 
be. It is exceedingly unfortunate to have difficulties of computation deter- 
mining what is computed. 

Some particular comments follow. At several points on pp. 8, 9, and 30 the 
author sets up straw men and vigorously demolishes them. I think that wheth- 
er or not one holds these opinions, it is well to give the beginner in factor 
analysis an opportunity to form his own judgment on the uses of mathe- 
matics and not to indoctrinate the student under the pretext of defending 
him. As previously said in this review, I think the discussion of the logic of 
factor analysis is gravely inadequate from the point of view of the non- 
mathematician. A little on spurious correlation might have been in order 
here. 

It is incorrect to say, p. 30, that the first factor accounts for as much vari- 
ance as possible by one factor. Not centroid but principal components factor 
analysis has this property. 

I wish the author had discussed what one has at the end of a factor analy- 
sis. Often, psychologists seem to differ on whether factor analysis primarily 
organizes information in a way that the analyst finds of help in his psycho- 
logical thinking or whether the factor analysis itself finds underlying factors 
that have some intrinsic existence. Again, some discussion of this point would 
have been very interesting and helpful to all. 

Thus, in this reviewer’s judgment, the author did a rather nice piece of 
work on the computing procedure for factor analysis by the centroid methoa 
but did not discuss the logic or implications of factor analysis adequately. 


Decision Processes. R. M. Thrall, C. H. Coombs and R. L. Davis, Editors. New 
York: John Wiley and Sons, 1954. Pp. viii, 332. $5.00. 


Patrick Suprss, Stanford University 


HIS book is a collection of nineteen articles originating from the Univer- 

sity of Michigan Seminar on decision processes, held at Santa Monica, 
California in the summer of 1952. None of the articles is meant to be a con- 
tribution to statistics proper, but almost every one has some kind of relevance 
to the foundations of statistical decision theory. The foundations of decision 
theory seem naturally to divide into three parts: the theory of utility, the 
theory of subjective (or objective) probability, and the theory of optimal 
strategy. The material in this book contributes most to the theory of utility 
and least to the theory of subjective probability. A good portion of the ex- 











1955 


od 
ical 
ther 
han 
en- 
r to 
uld 
ter- 


the 
th- 
tor 
he- 


ing 


yn- 
ler 
ri- 
or 
y- 
ly 


O- 


ld 











BOOK REVIEWS 1849 


perimental results are concerned with the study of actual behavior which 
deviates from statistical or game-theoretic notions of optimality. 

The nineteen articles are of rather uneven merit. Some such as those of 
Milnor and Hausner, present definitive results, while certain ones are ex- 
tremely tentative, even sketchy in character and content. My general im- 
pression of this book reinforces my general impression of the theory of 
decision-making: the easy plums have all been plucked and a hard climb 
lies ahead for those in search of new fruit. Solid progress on the climb is re- 
ported in the volume under review, but there are no leaps and bounds ahead. 

A good general introduction to the variety of topics treated is provided 
by Davis, one of the editors, The second article is an expository discussion 
of mathematical models and measurement theory by C. H. Coombs, Howard 
Raiffa, and R. M. Thrall. Although the viewpoint represented by these 
authors is very clearly and ably stated here, I find it hard to agree with them 
that any arbitrary binary relation defines a scale of measurement. My basic 
objection is that the classical ordinal, interval and ratio scales are defined 
by the group of numerical transformations under which the measurement 
results are invariant, and this group-theoretic definition does not generalize 
in a natural way to an arbitrary relation or even to a partial ordering. 

The remaining seventeen articles are grouped in four parts. Part I is 
concerned with problems of individual and social choice. Article 3, by Leo 
A. Goodman, on methods of amalgamation, is concerned with the common 
elements in the theory of social choice given individual values and the theory 
of optimal individual behavior in situations involving uncertainty. Article 
4, by John Milnor, on games against nature is the most substantial contri- 
bution in this part. Milnor shows in an elegant manner that neither the prin- 
ciple of LaPlace, the minimax principle, Hurwicz’s generalization of minimax 
nor Savage’s principle of minimax regret satisfies a list of intuitively desir- 
able properties. He then characterizes each of these four principles of de- 
cision-making in terms of an appropriate subset of his list of desirable prop- 
erties. The article concludes with some suggestions concerning criteria 
which satisfy the most important of the properties. In Article 5 Roy Radner 
and Jacob Marschak consider a simple game of betting on the outcome of a 
toss of a coin with unknown bias after an odd number of tosses, in order to 
show that both Hurwicz’s generalization of the minimax principle and Sav- 
age’s principle of minimax regret can lead to counterintuitive solutions. In 
Article 6 Coombs discusses the problem of social choice from a psychologist’s 
standpoint and presents some tentative experimental results on the applica- 
tion of his scaling methods to the problem. In Article 7 Stefan Vail presents 
some tentative ideas on alternative calculi of subjective probabilities. 

Part II is concerned with learning theory. In Article 8, R. R. Bush, Fred- 
erick Mosteller, and G. L. Thompson present a concise mathematical sum- 
mary and some new results on the Bush-Mosteller stochastic models for 
learning. These models are similar in formal structure to the states-of-nature 
model of statistical decision theory. Some interesting trapping theorems on 
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asymptotic behavior are established. It is shown in an appendix that the 
authors’ restrictive but rather natural condition on the combining of classes 
of responses implies that the stochastic operators representing changes in 
probability of response from one learning trial to another are linear. In 
Article 9, W. K. Estes discusses individual behavior in uncertain situations. 
Estes emphasizes the significance of various experimental results which in- 
dicate that when people are asked to choose between two possible events 
(such as one of two lights flashing) their choice patterns asymptotically 
approach the relative frequencies of the events. Since the optimal strategy 
is always to choose the more frequent event, Estes is concerned to offer 
explanation of the actual behavior in terms of “environmental uncertainties,” 
such as the fact that subjects are not told whether the events occur randomly 
or systematically. In my opinion statistical decision theory naturally sug- 
gests two remarks about Estes’ analysis. First, the experiments involved 
no risk to the subject. If subjects won or lost money as the result of correct 
or incorrect choices would the same asymptotic behavior be exhibited? 
Second, there is no attempt to characterize optimal behavior with respect 
to the a priori information given the subjects. An answer to this admittedly 
rather complicated problem would also be of use in analyzing the first n 
responses of subjects (in contrast to their asymptotic behavior). In Article 
10, Merrill M. Flood presents a preliminary formulation of some ideas com- 
bining game theory and learning theory. The general orientation is very 
similar to that of the Bush-Mosteller learning theory. 

Part III is concerned with the theory of utility and its applications. In 
Article 11, Gerard Debreu elegantly establishes sufficient topological con- 
ditions for a preference ordering to be representable by a numerical function. 
(The lexicographic ordering of points of the plane provides an example of a 
complete ordering which is not so representable.) In Article 12, Melvin 
Hausner investigates the consequences of dropping the Archimedean post- 
ulate in the von Neumann and Morgenstern theory of utility. His main re- 
sult is a representation theorem in terms of vector spaces. In Article 13, 
Thrall extends the results of Hausner in the previous article by a furthering 
weakening of the axioms, and then discusses various possible applications of 
multidimensional utility theory in game theory. These applications hinge 
upon having a lexicographic ordering of the vectors representing the utili- 
ties; one tries to maximize the first component, then the second, etc. In 
Article 14, Jacob Marschak analyzes the activities of “teams” on the basis 
of game theory and statistical decision theory. A team is defined to be a 
group for which the group utility function is essentially the same as the in- 
dividual utility functions of the group members. Concepts of organization, 
communication and action are introduced to analyze team activities. The 
unwieldly apparatus needed for the study of optimal team behavior is to 
be expected in view of the many known complications of coalition behavior 
in n-person games. However, some simple examples are presented which 
argue well for the intuitive reasonableness of the general concepts defined. 
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In Article 15, Herbert G. Bohnert critically examines the logical structure 
of the utility concept. Although some of his remarks are well-taken, in 
choosing to analyze statements of the form “The utility of xz for entities 
1,2, -: +, min the amounts y1, yz, ° ++, Yn, is equal to u utiles,” he has, I 
think, gone amiss. Statements of this form do not represent direct observa- 
tions of individual z’s behavior; rather they are complicated inferences 
made by means of a necessarily elaborate theory of decision-making from 
observations of 2z’s actual choices. The general theory of decision-making 
rather than isolated numerical utility statements would seem to be a better 
focal point of criticism. 

Part IV explicitly deals with experimental studies of decision processes, 
although some of the previous articles also reported on experimental results 
(particularly Article 9). In Article 16, Paul Hoffman, Leon Festinger, and 
Douglas Lawrence present the results of a carefully designed experiment on 
the formation and dissolution of coalitions in differently motivated situations. 
The chief experimental variables were task importance and degree of com- 
parability between players of the game (peer versus non-peer conditions). 
Although their conceptual framework does not use the theory of n-person 
games in ar: “erious way, their experiment throws interesting light on some 
of the probleius of developing a theory adequate to deal with actual behavior 
in competitive situations. In brief their findings were that a player given 
an initial advaatage was irrationally (from a game-theoretic standpoint) 
denied opportunities to form coalitions. His opportunities were further re- 
duced by an increase in the importance attached to the game. This “irration- 
lity” of the other :layers was more manifest if the player with the advantage 
was rated as a peer rather than a non-peer. In Article 17, Coombs joins with 
David Beardsice to present a general model for experiments on decision- 
making under uncertainty. Since only some very tentative experimental 
results are reported and most of the pages are given over to a description. 
of the model, this article could perhaps as well have been included in Parts 
I or III, as Davis points out in his introductory article. However. their 
model is experimentally oriented. It is not described in a mathematically 
precise manner, and it is not clear that any new mathematical consequences 
could be derived if it were formulated sharply. In Article 18, Flood makes a 
second contribution dealing with a pilot experiment which was partly stimu- 
lated by the results of Estes reported in Article 9. Flood’s aim was to test 
the hypothesis that people’s strategies in a “prediction” experiment are 
“mixed” or “pure” depending on whether they believe the events being 
predicted are patterned or random. The experimental results obtained did not 
conclusively support the hypothesis, but they were significant enough to 
warrant further investigation. Flood tries to meet the kind of criticism con- 
cerning the question of optimality which was raised above in discussing 
Estes’ article, but he does not examine the problem of optimality for the 
patterned case in any detail. In Article 19, the final article, four mathemati- 
cians, G. Kalisch, J. W. Milnor, J. Nash and E. D. Nering, report on some 
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interesting experiments concerning various theoretical concepts in n-person 
game theory: strategic equivalence of games, von Neumann-Morgenstern 
solutions, the Shapely value, Milnor’s bounds for “reasonable” outcomes, 
Nash’s equilibrium points. Although the experimental results are somewhat 
tentative and too complicated to summarize here, they represent a worthy 
pioneering effort in a difficult field. 


Mathematics and Plausible Reasoning. Volume I, Induction and Analogy in 
Mathematics. Volume II, Patterns of Plausible Inference. G. Polya. Princeton: 
The Princeton University Press. London: Geoffrey Cumberlege, Oxford Uni- 
versity Press, 1954. Volume I, pp. xvi, 280. Volume IT, x, 190. $9.00; Volume I, 
$5.50; Volume II, $4.50. 


Leonarp J. SavaGe, University of Chicago 


= excellent book has a double interest for statisticians. In the first 
place, I never knew a statistician who thought he knew enough mathe- 
matics for his job, and this book, especially Volume I, is effectively designed 
to increase the mathematical power of a remarkably wide audience, an au- 
dience including everyone, no matter how advanced, who has studied college 
mathematics. In the second place, nondeductive reasoning is of paramount 
importance to the statistician, and this book is rich in examples and insights 
bearing on application of such reasoning to mathematics. These applications 
are unlike those to which textbook statistical theory gives access, but that, 
I think, should enhance their value to statisticians. No one is better qualified 
than Polya in interest, experience, or technical or expository skill to write on 
nondeductive reasoning in mathematics and its role in the development of 
mathematical power, as this book and others he has written bear witness. 

The scope and tenor of the book can be well conveyed by quoting certain 
passages (paragraphing mine): 


This book has various aims, closely connected with each other. In the first 
place, this book intends to serve students and teachers of mathematics in 
an important but usually neglected way. Yet in a sense the book is also a 
philosophical essay. It is also a continuation and requires a continuation. 
i, 

The standards of plausible reasoning are fluid, and there is no theory of 
such reasoning that could be compared to demonstrative logic in clarity or 
would command comparable consensus. [I, v] ... Another point concerning 
the two kinds of reasoning deserves our attention. Everyone knows that 
mathematics offers an excellent opportunity to learn demonstrative reason- 
ing, but I contend also that there is no subject in the usual curricula of the 
schools that affords a comparable opportunity to learn plausible reasoning. 

I address myself to all interested students of mathematics of all grades 
and I say: Certainly, let us learn proving, but also let us learn guessing. 
{I, v]... Mathematics is regarded as a demonstrative science, [I, vi] ... 
Yet mathematics in the making resembles any other human knowledge in 
the making. You have to guess a mathematical theorem before you prove 
it; you have to guess the idea of the proof before you carry through the 
details. You have to combine observations and follow analogies; you have 
toztry and try again. [I, vi]... 
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I do not believe that there is a foolproof method to learn guessing. At any 
rate, if there is such a method, I do not know it, and quite certainly I do not 
pretend to offer it on the following pages. The efficient use of plausible 
reasoning is a practical skill and it is learned, as any other practical skill, by 
imitation and practice. I shall try to do my best for the reader who is anxious 
to learn plausible reasoning, but what I can offer are only examples for 
imitation and opportunity for practice. In what follows, I shall often discuss 
mathematical discoveries, great and small. I cannot tell the true story how 
the discovery did happen, because nobody really knows that. Yet I shall 
try to make up a likely story how the discovery could have happened. 1 
shall try to emphasize the motives underlying the discovery, the plausible 
inferences that led to it, in short, everything that deserves imitation. (I, vi] 
... Each chapter will be followed by examples and comments. [I, vii] .. . In 
order to provide (or hide) such clues with the greatest benefit to the in- 
struction of the reader, much care has been expended not only on the con- 
tents and the form of the proposed problems, but also on their disposition. 
{I, vii] . . . In short, I tried to use all my experience in research and teaching 
to give an appropriate opportunity to the reader for intelligent imitation 
and for doing things by himself. [i, vii] . . . The examples of plausible reason- 
ing collected in this book may be put to another use: they may throw some 
light upon a much agitated philosophical problem: the problem of induc- 
tion. The crucial question is: Are there rules for induction? Some phi- 
losophers say Yes, most scientists think No. In order to be discussed profit- 
ably, the question should be put differently. It should be treated differently, 
too, with less reliance on traditional verbalisms, or on new-fangled formal- 
isms, but in closer touch with the practice of scientists. [I, vii] ... And so 
the door opens to invesiigating induction inductively. [I, viii] .. . 

It is more philosophical, I think, to consider the more general idea of 
plausible reasoning instead of the particular case of inductive reasoning. 
It seems to me that the examples collected in this book lead up to a definite 
and fairly satisfactory aspect of plausible reasoning. Yet I do not wish to 
force my views upon the reader. In fact, I do not even state them in Vol, I; 
I want the examples to speak for themselves. The first four chapters of 
Vol. II, however, are devoted to a more explicit general discussion of 
plausible reasoning. There I state formally the patterns of plausible in- 
ference suggested by the foregoing examples, try to systematize these pat- 
terns, and survey some of their relations to each other and to the idea of 
probability. [I, viii]... 

This work on Mathematics and Plausible Reasoning, which I have always 
regarded as a unit, falls naturally into two parts. [I, viii] ... For the con- 
venience of the student they have been issued as separate volumes. Vol. I is 
entirely independent of Vol. II, and I think many students will want to go 
through it carefully before reading Vol. II. It has more of the mathematical 
“meat” of the work, and it supplies “data” for the inductive investigation of 
induction in Vol. II. [I, viii]...I have not provided an index, since an 
index would tend to render the terminology more rigid than it is desirable in 
this kind of work. [I, ix)... 

The present work is a continuation of my earlier book How to Solve It. 
The reader interested in the subject should read both, but the order does 
not matter much. [I, ix)... Yet there are indirect references to the former 
book on almost every page, and in almost every sentence on some pages. 
[I, ix]... The present book is also related to a collection of problems in 
Analysis by G. Szegé and the author (see Bibliography). [I, ix]... 

Some knowledge of elementary algebra and geometry may be enough to 
read substantial parts of the text. Thorough knowledge of elementary al- 
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gebra and geometry and some knowledge of analytic geometry and calculus, 
including limits and infinite series, is sufficient for almost the whole text and 
the majority of the examples and comments. [I, xi]... The advanced 
reader who skips parts that appear to him too elementary may miss more 
than the less advanced reader who skips parts that appear to him too com- 
plex. [I, xi] .. . Some of the problems proposed for solution are very easy, 
but a few are pretty hard. [I, xi]... 

Let us note: two persons presented with the same evidence and applying 
the same patterns of plausible inference :nay honestly disagree. [II, lll]... 

The trouble with the concept of the “credibility of a conjecture” is that 
we do not know any operational definition for it. [II, 117]... It could be 
that there is no reasonable decision, no reasonable way to say which evidence 
is stronger than the other. This possibility is so important that it deserves 
a name. If there is no reasonable way to decide which evidence is stronger, 
E, or Es, let us call £; non-comparable with E;. (II, 137]... 

Let us note this striking difference that separates the problem-solver’s in- 
vestigation of the workability of his plan from the inductive investigation 
of a mathematical or physical conjecture, or from the judicial investigation 
of a charge: it is the difference between a changeable, or fleeting, and a 
determinate, relatively well defined object. [II, 154). 


The book is accurately and beautifully printed. The occurrence of “equi- 
latera” in Example 3, p. 161, is the only misprint I noted. I saw no mathe- 
matical mistakes, though there is one extramathematical point, Example 
(3), pp. 85-86, which is misleading at best. In this example Polya and his 
bank agree that Polya’s balance is $331.49. How good is the evidence that 
they are both right? Polya attempts to adduce an answer in terms of the 
probability, 10-5, that two five digit numbers chosen uniformly at random 
will agree. But, of course, there is no real possibility that Polya or his bank 
did choose at random. If they did each make an error, the probability that 
they both-made the same one, or equivalent ones, may well be reckoned as 
much larger than 10-5, 

Let me end on the right note, commending the book to one and all for 
pleasure and profit. 


Scientific Explanation: A study of the function of theory, probability and law 
in science. Richard Bevan Bratihwaite. Cambridge: University Press, 1953. Pp. 
xi+376. $8.00. 


Epwin B. Witson, Office of Naval Research, Boston 


ane Braithwaite holds the distinguished chair of Moral Philos- 
ophy in the university of Cambridge in succession to C. D. Broad. Being 
a philosoper he must naturally be better acquainted with philosophy than 
with science and cannot fail to use language carrying connotations unna- 
tural to the scientist and to make statements about science which jar a 
scientist. On the other hand the professional scientist who undertakes to 
write on the philosophy of science without an intimate familiarity with 
philosophic ideology and nomenclature cannot fail to shock the philosopher 
by exhibitions of philosophical naiveté. 

A few like Herbert Dingle or Victor Lenzen may be at home in both science 
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and philosophy, diverse as the philosophies and sciences are, but I am not 
one of them. I may note, however that the philosophies have never been 
unified, that despite much talk the sciences have never been unified, and 
that actually no single science which is alive has been unified. A philosopher 
writing about scientific explanation may be illuminating to his fellow philos- 
ophers but it cannot be expected that he will be any more enlightening to 
scientists than one of them would be to philosophers if he chose to write 
on philosophic explanation. 

I will note first that the name of Bridgman does not occur in the index and 
that I can find little evidence in the book that the author places much weight 
upon the operational concept or upon the “philosophy” of the experimen- 
talist or observer. “Scientific Explanation” appears to be the philosophy of 
the philosopher and mathematician rather than of the scientist. This is 
usual, I must admit, but it is onesided and how hungry scientists are for a 
bit of the other side may be seen from the reception accorded to Bridgman’s 
Logic of Modern Physics. 

The author states (p. 51): “Even if it be granted that the phrase ‘hydrogen 
atom’ has a straightforward meaning which can be understood by itself, it 
cannot be pretended that the words ‘electron’ and ‘proton’ can be properly 
understood without reference to the ceductive system of physics in which the 
propositions expressed by means of them occur.” When I was in charge of 
the department of physics at the Massachusetts Institute of Technology 35 
years ago I was teaching that matter was electrical in nature (the neutron 
had not been discovered), that is, | was putting down as what the author 
calls the highest level hypothesis about matter that it was made up of pro- 
tons and electrons, and I was doing this largely on the basis of the experi- 
mental work in the great Cavendish Laboratory. As physics, the author’s 
statement is unintelligible to me; it may be good (oldfashioned) chemistry— 
the highest level hypotheses of different sciences may be different and as a 
matter of fact the highest level hypotheses of different parts of the same 
science may be contradictory. Equally unintelligible is his remark (p. 275): 
“On Keynes’s theory inductive confirmation serves only to increase the 
probability of a hypothesis by multiplying it by another probability . . .”— 
so indoctrinated am I with the mathematical notions that probabilities are 
proper fractions and that the product of two proper fractions is less than 
either. Moreover, I cannot understand either the text or the footnote (p. 
247) about the exactness of the probability of a representative point falling 
outside a confidence belt in the discontinuous (point-binomial) case. 

One should note that (p. 8) the author states: “The publicity of its data 
is therefore not used (as many writers would wish to use it) as the hall-mark 
of science.” He is concerned with the private date of psychology and the 
social sciences. During the 15 years I was active in the Social Science Re- 
search Council we considered that such private data did not become part 
of science until somehow communicated to others, and then they had to be 
accepted with great caution. The difficulty of obtaining the communication 
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without influence of observer on observed and vice versa led Bohr long ago to 
suggest that his principle of complementarity developed to cover the inevj- 
table interactions between the behavior of atomic objects and the measuring 
instruments applied to them probably applied in some form to psychological 
science, but I believe the author does not examine this suggestion. 

Professor Braithwaite’s book is divided into eleven chapters, of which the 
first four deal with the logic of deductive systems. This is an important as- 
pect of scientific explanation and is well done; had it been available to me a 
half century ago when I was teaching symbolic logic at Yale I should have 
been delighted to include some of it in my course as a fruitful application. 

The next three chapters deal with probability and statistics, and pre- 
sumably are those which justify a review in this Journal. Suffice it to say 
that the author, while admitting his inspiration from his teacher J. M. Keynes 
and dedicating his book to his memory, abandons Keynes’s “philosophic” 
approach for the mathematical. This shows the extreme logico-mathematical 
slant of the author and may be the best claim his book has to originality 
of presentation for the professional philosopher. But it goes a long way 
toward putting him out as a statistician; for the statistician like the scientist 
has to be concerned primarily with the collection and arrangement of and 
the reasonable inferences from observed data. Some mathematics will surely 
help, too much will as surely hinder him ia his pursuits. I do not disqualify 
as bad statistical workmanship the last work of Yule, The Statistical Study 
of Literary Vocabulary, because he failed to set up a background of mathe- 
matical probability theory for determining the “significance” of his results. 
Really new work ir science and statistics is often if not usually like that. 

The antithesis between statistics and probability goes back to the beginning 
of probability theory, being found clearly stated in a letter of Leibniz to 
Jac. Bernoulli, but Leibniz’s name like Bridgman’s does not occur in Braith- 
waite’s index. The point Leibniz makes is that, although the calculation of 
probabilities is useful, it need not be too meticulous because although Nature 
has her habits, she does not repeat unerringly and your calculations will not 
constrain her to do so, and hence you should be particularly circumspect in 
taking account of all possible circumstances. Most philosophers and the 
continental statisticians Lexis, Bortkiewicz, Charlier, and 8. D. Wicksell 
have followed Leibniz, as did Keynes; Braithwaite does not, he follows 
Neyman and Wald, Morgenstern and von Neumann. Thus he gives his fel- 
low philosophers a chance to see what the most up-to-date work can do for 
them, so far as he is able to make his exposition intelligible to them. 

The final four chapters deal with philosophic matters: The justification of 
induction, Laws of nature and causality, Causal and teleological explana- 
tion, Explantion of scientific laws. Many thousands of pages have been 
written by philosophers on these subjects and I have read a considerable 
number of hundreds thereof without enough enlightenment, without finding 
enough agreement, to justify my commenting. This is no evidence that these 
chapters or the many thousands of pages previously written are of no value. 
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Professionally philosophy is a very ancient and technical discipline. Hairs 
are split very fine and a highly developed special language or special usage 
of language may easily mislead the non-professional, who should keep out 
of the fracas. 

I should like, however, to add a reference to an essay on scientific method 
which I believe has been insufficiently cited. It is to the presidential address 
of T. C. Chamberlin on “The Method of Multiple Working Hypotheses” 
printed in the Journal of Geology, 5, 837-48, reprinted Ibid., 39, 155-65, 
and again in Scientific Monthly, 59, 357-62. As a contribution to the dis- 
cussion of scientific induction, as a scientist’s statement of how one may con- 
sciously try to follow Leibniz’s advice to take account of all the circum- 
stances, it seems to me to have real merit. 

One of the difficulties of following the mathematical fraternity is that one 
may adopt some of their conventions too literally. They talk a great deal 
about the danger of small samples misleading one to mistake a mere fluc- 
tuation of sampling for a reality of science; they say little about the dangers 
of large samples which may give a highly significant result that is largely 
bias and not reproducible. This difference between “errors of observation” 
and “constant or systematic errors” has been recognized by astronomers for 
a long time. And they have a way of using P =.05 as a standard significance 
level, which is all right if understood, but the businessman, the investor, the 
weather forecaster, the executive, or the card player who waited for that 
degree of significance, would be so out of the game as to be without a live- 
lihood. Theodore Roosevelt is said to have reinarked that an executive who 
made four right decisions out of seven was good—a “confidence” of .556 
instead of .95. For practical purposes we have to take chances with our in- 
ferences, and actually no matter how meticulous our probability calculations, 
we have to take chances on those probabilities. 

However science is always provisional and usually approximate, and thus 
constantly being corrected. I hope that the author’s neatness of proof and 
definiteness of statement does not lead any reader to misconceive this funda- 
mental characteristic of science. The author throws out some warnings; 
may they prove sufficient! 


- 


Transactions of the Symposium on Computing, Mechanics, Statistics, and Par- 
tial Differential Equations, Volume II. Symposium on Applied Mathematics. 
Sponsored by the American Mathematical Society and Office of Ordnance Re- 
search, U. S. Army. New York and London: Interscience Publishers, 1955. Pp. 
216. $5.00. 

Ricuarp Bettman, The RAND Corporation 


HIS volume consists of eleven papers containing invited addresses de- 

livered at the Second Symposium in Applied Mathematics held at the 
University of Chicago, April 29 and 30, 1954. 

The papers of particular interest to statisticians are those by H. O. 
Hartley and J. Neyman, dealing with technical aspects of statistical theory, 
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two papers on numerical analysis by M. R. Hestenes and J. Todd, and a 
paper by P. M. Morse on Operations Research. For those interested ip 
probability theory there are papers by J. E. Mayer on statistical mechanics 
and by W. Feller on differential equations of the type occurring in diffusion 
theory. 

The remaining papers are three in number, a paper by J. J. Stoker on 
stability theory, a paper by F. J. Bureau on partial differential equations, 
and a paper by C. Truesdell on elasticity theory. 

We shall discuss the papers in the above-mentioned order, allocating the 
greater part of our attention to those papers which we feel will be of most in- 
terest to the readers of this journal. 

The paper by J. Neyman, entitled “The Problem of Inductive Inference” 
consists of two parts. The first part is devoted to a discussion of basic con- 
cepts of statistical theory, the frequentist versus the nonfrequentist theories 
of inductive inference. In particular, points of agreement and disagreement 
between Neyman and recent studies of R. Carnap and R. B. Braithwaite 
are discussed. 

The second part of the paper is devoted to a detailed application of these 
ideas to the problem of the determination of the decay rates of certain phys- 
ical particles called neutral V-particies. 

The paper concludes with a brief summary of recent trends in statistical 
theory, and contains a bibliography of fifty-three papers. 

Following this is a paper by H. O. Hartley, “Some Recent Developments 
in the Analysis of Variance.” The author limits himself to a discussion of 
two topics, multiple decisions and comparisons, and short cut procedures 
based on substitute measures of dispersion. In addition to describing his own 
techniques, he gives brief summaries of related techniques. There is a bibli- 
ography of forty-seven papers. 

We come now to two papers or numerical analysis. The first paper by M. 
Hestenes, “Iterative Computational Methods” centers around the problem 
of developing iterative techniques based upon a minimum principle. The 
gradient and conjugate gradient methods are presented, and a third method 
is given for the problem of matrix inversion. The paper concludes with a 
discussion of some sources of errors, and eigenvalue problems. 

John Todd’s paper, “Motivation for Working in Numerical Analysis,” 
is an interesting and encyclopedic survey of some of the difficult and im- 
portant problems that arise in the supposedly routine task of computing 
numerical solutions. Todd discusses in turn optimal procedures for evaluat- 
ing polynomials, increasing the rapidity of convergence of sequences, inter- 
polation problems, characteristic roots of matrices, quadratures, and game 
theory. At the conclusion of the paper, he discusses some of the new explora- 
tory trends in numerical analysis. There is a bibliography of seventy-five 
papers. 

An expository article on “Operations Research” is contributed by P. M. 
Morse. He discusses a number of mathematical tools that have proved useful 
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in operational analysis and the types of problems to which they have been 
applied. 

We consider next two papers in the domain of mathematical probability 
theory. In his paper “Two Unsolved Problems of Statistical Mechanics,” 
J. E. Mayer presents two of the outstanding difficulties of the theory. The 
first is the curse of dimensionality, the fact that sensible models of molecular 
phenomena seem to run into problems involving impossible numbers of 
variables, say 10%. The second occurs in the theory of rate processes where 
the difficulty is more primitive, namely one of logical formulation. 

The second paper in probability theory is by W. Feller, “On Differential 
Operations and Boundary Conditions.” The general problem is that of con- 
structing a theory of the second order differential operator with general 
boundary conditions and not necessarily continuous coefficients. The con- 
nections with diffusion theory and the theory of semi-groups is discussed. 

The longest paper in the volume, some fifty-five pages, is a detailed account 
of the Hadamard theory of the finite part of a divergent integral by F. J. 
Bureau, entitled, “Divergent Integrals and Partial Differential Equations.” 
A number of applications, developed by the author, are given. 

The last two papers are in the field of mechanics and mathematical physics. 
J. J. Stoker discusses a number of different formulations of the concept of 
stability, and C. A. Truesdell discusses a topic in elasticity theory. 


An Introduction to Linear Programming. A. Charnes, W. W. Cooper, and A. 
Henderson. New York: John Wiley and Sons, Inc., 1953. Pp. ix, 74. $2.50. Paper. 


F. A. Ficken, University of Tennessee 


» RECENT years many problems of practical interest have been formulated 
so as to require an extreme value of a linear function subject to linear 
conditions (equalities or inequalities) on the variables. In a “transportation 
problem,” for example, factories j (j=1,---, ») with products p; and 
warehouses a (a=1, +--+, m) with capacities wa are given, along with unit 
shipment costs ca; to a from j, and it is required to find shipments s.; 20 
such that 2j8aj=Wa, La8aj=p;, and the total shipping cost ZajCaj8a; shall be 
minimum. The term “linear programming ” has come to be applied to the 
formulation and (more specifically) the solution of such problems. 

A particular method of arranging the calculations leading to a solution, the 
“simplex method,” seems to have conspicuous advantages over other 
methods. The book under review is an explanation of the simplex method 
from first principles; no other such explanation, so far as the reviewer knows, 
has been offered by a commercial publisher. 

The authors point out (p. 38) that a single code for the simplex method on 
a digital computer will handle, beyond the problem mentioned above, ma- 
trix inversion and related processes, solution of systems of linear inequalities, 
and solution of a zero-sum two-person game (there being in fact an intimate 
connection between linear programming and game theory). Other connec- 
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tions with statistics are also indicated. A less direct but powerful motivation 
for statisticians lies in the fact that consumers of statistical wisdom seem 
likely also to have problems which they may wish to have assessed with 4 
view to linear programming. 

The first half of the book, by Cooper and Henderson, is entitled “An Eco- 
nomic Introduction to Linear Programming.” The simplex method is ex- 
plained and illustrated in detail, with calculations, in terms of an “input- 
output” problem (specifically, a nut-mix problem). Various questions arising 
in the course of the calculations are fully discussed, and there is much valu- 
able interpretative comment. The distinction between “slack” and “arti- 
ficial” variables emerges clearly, and there is a detailed discussion of the 
dual problem. A useful bibliography is appended. 

The second half of the book, entitled “Lectures on the Mathematical 
Theory of Linear Programming,” consists presumably (in view of the foot- 
note on p. 41) of notes on lectures by Charnes. Starting with the necessary 
facts from linear algebra and geometry and from the theory of convex poly- 
hedra, a reasonably complete theoretical account of the simplex method is 
presented. The troublesome question of degeneracy is treated in detail by a 
perturbation method. There is a brief section on duality and one on the re- 
laxation method of solving a system of linear inequalities. The reviewer's 
feeling that the second half is rather less satisfactory than the first is prob- 
ably a matter of expository taste. There are a few notational anomalies and 
several misprints which should not, however, cause serious trouble. 

The preface suggests selections of material for various classes of readers. 
The book has no index. 

The volume suffers, in the opinion of the reviewer, from a lack of coordina- 
tion between the two parts. The authors of the first half find themselves in 
need of standard terminology, notation, and results, and must either refer 
forward to the second half, as they do for “vectors,” or develop the material 
themselves, as they do in their section on matrices. Many important topics 
are discussed twice. It might be more economical, and perhaps more en- 
lightening as well, to give at the outset a rudimentary survey of linear alge- 
bra, and perhaps even a few of the special results used later, in such a way 
as to provide a linguistic and conceptual framework adequate for both the 
economic introduction and the later mathematical theory. 

Some readers might find it helpful to note that the operations performed in 
passing (by “the algorithm”) from one tableau to another are precisely those 
used when one has a system of linear equations and has selected a particular 
equation (say the a-th) and, in that equation, a particular variable A; whose 
coefficient x,.; does not vanish, and one wishes to eliminate \; from all other 
equations; one divides the equation by 2.; in order to give A; the coefficient 
unity, and then, in order to eliminate A; from the 6-th equation, one multi- 
plies the a-th equation by 2g; and subtracts from the 8-th equation. The 
“algorithm” becomes immediately intelligible in terms of this familiar opera- 
tion. 
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Industrial Process Control by Statistical Methods. John D. Heide. New York: 
McGraw-Hill Book Co., 1952. Pp. ix, 297. $6.00. 


Epwin C. Harrineton, Jr., Monsanto Chemical Co. 


HE publication of a technical book, theoretically at least, should fill some 
Tes in, or extend the frontiers of, our expanding knowledge. Industrial 
Process Control by Statistical Methods is quite definitely designed to fill the 
gap created by the operating man in the factory, who is anxious to apply 
the methods of SQC, but is without formal training in mathematics beyond 
high school algebra. For such a person, this book should have considerable 
appeal. The writing is lucid, the logic is simple, and the organization of 
material reflects the thought and the experience of one who has struggled to 
inculcate the concepts of SQC in the operating levels of a large industrial 
concern for many years. 

To the practicing statistician or even to the engineer with a minimum of 
training in statistical quality control, the book will not be as satisfying as 
many which are addressed to a beginning, but intelligent audience. Except 
for the method of presentation (which is good) there is little which has not 
already been discussed elsewhere. The content, unfortunately, stops short of 
such important aspects of SQC as the theory of runs, the principles of sam- 
pling, and basic probability theory. The author has weakened the book, in 
his attempt to avoid mathematical hurdles, by limiting the discussion of 
probability to the empirical derivation of the normal and binomial distri- 
butions. The central limit theorem is recognized but neither stated nor dis- 
cussed, and such facets of applied statistics as the Tchebycheff Inequality 
are omitted completely. A fact which speaks for itself is that the denominator 
for the standard deviation is “n.” 

Nor can this reviewer feel that the book is, as its Preface claims, of 
“_..value for college use by students planning to enter industry.” There 
is no question that, lacking any kind of statistical training, they would find 
value in the book, but it would not be acceptable at college levels, nor, I 
think, did the author really intend it to be. 

Several chapters are devoted to the psychological impact of SQC on a 
conservative management and labor force in an established plant. Words of 
warning and techniques of persuasion are well taken, and these sections could 
be of great value to the harassed and potentially valuabie engineer. 

The book is not alone in covering much of this ground in low gear, and with 
its publication it is likely that prospective authors and publishers will con- 
sider the gap into which this book falls quite satisfactorily filled, certainly 
until a significantly different approach justifies further publishing effort. 


Le Controle Statistique des Fabrications. R. Cavé. Paris: Eyrolles, 1953. Pp. 432. 
3950 fr. 
Anpre G. Laurent, Michigan State University 
[: 1s the feeling of the reviewer that, in publishing the present book which 
is hastily written and whose presentation contrasts with the usual 
thoroughness of the author, Cavé had in view securing a wider diffusion of a 
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previous work he published three years ago in several French periodicals, 
The compilation of additional material has enlarged the original paper (about 
thirty pages of the text) to the size of a book and explains to some extent the 
lack of unity of the treatment. 

The First Part (74 pages), “Statistical Material,” introduces the reader 
to frequency distributions, elementary notions of probability, probability 
distributions, estimation and testing hypothesis. The author is struggling 
with a desire to be done with the work he undertook and pleads guilty as 
he starts: “The book will give . . . in the first part . . . definitions . . . neces- 
sary alas.” The looseness of expression and the frequent inconsistencies of 
style (which are present throughout the book) will be resented by French 
readers, and they will make the text especially difficult for English-speaking 
readers. 

The following passages provide examples of these defects: “We have 
studied .. . complete populations, that is, [populations] which contain the 
total number of individuals which constitute them.” “The occurrence of a 
given value [of a random variable] is called a random event. One sees the 
analogy with the character defining a population, but, while we can freely 
choose the value z of a character X of a distribution, the value of a r.v. is 
determined by chance.” (p. 37). “[If] Pr[Zi<a< Z2]=1l—a - -- there area 
chances that the interval - --[Z:, Zs] --~- will contain - - - [a]” (p. 75). 
“Fisher’s theorem is satisfied” (p. 263, 303) means that it is legitimate to 
apply Fisher’s theorem. Quoting further examples would be tedious. 

Though the book is intended for the practical man, a more serious attempt 
could have been made to illuminate the underlying principles and the fun- 
damental concepts of statistical methods. Nothing is said about sampling; 
neither Bernoulli’s theorem nor its implications are mentioned; the defini- 
tion of probability is the circular one, and there is no bridge between the 
discrete and the continuous case. The problem of estimation is introduced as 
follows (p. 67): “Statement of the problem—We have already defined several 
parameters summarizing the data of a table . . . , assuming that all the mem- 
bers of the population are known. But, generally the set of n measurements 
so obtained ...is a sample drawn from a much larger population of NV 
values. We shall denote the real parameters by the letters ... we used in 
Chapter I, the estimations we shall be able to obtain from the measurements 
will be denoted by the same letters, with a prime, if they are computed, from 
the measurements, with the formula of definition.” The introduction to test- 
ing hypotheses is even more sketchy: “Suppose one wants to test the validity 
of a hypothesis about a population parameter a. For instance, one wants to 
accept the Hypothesis Ho, if a=ao (a given value) or to test a Hypothesis 
H, against a Hypothesis H2. The operating characteristic curve . . . is that 
curve giving the probability of accepting the assumed hypothesis (for in- 

‘stance @=4ap) as a function of the possible values of a.” The operating char- 
acteristic curves are then classified according to the range of the parameter. 

Several statements are wrong or misleading. A few examples will suffice. 
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“Prime numbers are normally distributed” (p. 54); “The median . . . a value 
with rank n/2” (p. 67); “The median ...a value with rank (n—1/2” (p. 
358); “dF'(x;) - - - called elementary density” (p. 31); “A [maximum likeli- 
hood] estimate [of a] is the root of dL/da=0 with 0*L/d*a <0.” (p. 73); 
“(Pearson’s distributions] satisfy - - - Y’=(a—z)Y/bo+bit+b.2’, x varying 
from a to B (roots of the denominator)” (p. 32); ete. ... 

The Second Part, (76 pages), deals with process control. Chapters V and 
VI present the conventional Shewart control charts. Original features are: 
the introduction of operating characteristic curves for control charts as func- 
tions of the proportion of defective items, clearly illustrated by convenient 
graphs; a realistic discussion of the frequency of sampling; and the emphasis 
on control charts limits based on Tchebychev’s inequality. Chapter VII 
“New Methods of Control” reproduces the material covered in the paper 
previously published by Cavé. (a) A first part expresses the “leading prin- 
ciples of the methods.” The reviewer does not agree with Cavé’s assertion 
that a process is “under control” either if the central tendency and the dis- 
persion of the process do not vary too much or if they vary in such a way that 
the proportion of defective items is still acceptable. In the reviewer’s opinion, 
the soundest feature of Shewart’s contribution has been to illuminate the 
methodological distinction to be established between the product and the 
machine, the lack of ability of a process to fit the specifications and the un- 
controlled systematic causes which affect the production. The aim of the 
control chart method is not to insure the producer against an unacceptable 
amount of defective items but rather to protect him against the unforseeable 
consequences of a “lack of control.” I would not trust a physician who would 
not treat cancer as long as the pain is endurable. (b) A second part deals with 
Control Charts by variables. In setting up modified X charts control limits 
based on the specifications (for known a), the author proposes to extend the 
“acceptance inspection by variables” procedures to the “process control.” 
He must be congratulated for the very useful charts and graphs he presents. 
(c) A third part is concerned with the “compressed limit gauging” method. A 
plan is devised which mimimizes the amount of inspection (for known a) 
for given a=86 type I and II risks. The efficiency of the method is compared 
with that of contro! charts by variables. A sequentiai plan is set up. Original 
graphs and charts are presented which greatly simplify practical application. 

Part III (46 pages) deals with sampling inspection of infinite lots (the 
hypergeometric distribution is not mentioned in the book). It summarizes 
the standard material on the subject and reproduces a large part of the ma- 
terial already covered in Chapter VII. 

The intent of Part IV, “Statistical Methods in Research” (120 pages) is 
to provide the engineer with standard statistical tests and techniques. De- 
spite an unattractive typographical presentation and some of the defects men- 
tioned above, this aim is reached. This handbook of routine methods thus 
treated by solved exercises (all of them from the field mechanical engineering) 
touches upon a broad variety of topics, including variance analysis, statistical 
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dependence, regression (orthogonal polynomials), and is completed by an 
interesting discussion of the statistical aspects of interchangeability and 
tolerances. 

The engineer will welcome the set of tables which, at the end of the book 
recapitulates the definitions, the notations and the standard methods. Un- 
fortunately it contains some inaccuracies. 

Numerical tables reproduced from English and American publications 
complete the book. There is some carelessness in the way they are presented, 
as far as the layout is concerned; further, the titles have been systematically 
modified and, very often, they do not indicate clearly the content of the 
tables. 

The book will not bring any substantially new material to the American 
reader; it will reach, in France, an audience of industrialists and engineers 
to whom it will provide immediately applicable methods. 


Proceedings Symposium on Operations Research in Business and Industry. 
Kansas City, Missouri: Midwest Research Institute, 1954. Pp. iv+185, $5.00. 
Paper. 


L. H. C. Tiprert, Shirley Institute, Manchester 


foyer is a collection of papers given at a conference, together with a report 
of the somewhat scrappy discussion following each paper. The intention 
of the conference was apparently to tell some 250 industrial engineers and 
executives what operations research is about and the whole collection belongs 
to the class of ephemeral literature. 

There are four general papers: “History and Prospects for Operations 
Research”, by C. West Churchman; “A Discussion of the Major Tools of 
Operations.Research,” by Edward C. Varnum; “Experimentalism, Statis- 
tical Control and Cybernetics in Industrial Management,” by Sebastian B. 
Littauer; and “Operations Engineering, Planning and Research,” by J. E. 
Garrett. Churchman, in discussing whether there is anything new in opera- 
tions research, propounds the following interesting law of scientific growth: 
“Whenever potential applications of the methods of scientific research grow 
faster than actual applications, a new development under a new name will 
take place to fill the gap.” Most of the tools listed briefly by Varnum are 
statistical. Littauer’s paper analyzes the fundamentals of industrial control 
and draws parallels with such things as servo mechanisms. Most of the 
substance of Garrett’s paper is a lively development of the theme that in- 
dustrial experiments should be done properly and that the O. R. team are 
the people to see that this is done. 

The remaining papers deal with particular fields of application of opera- 
tions research or with applications to particular problems. They are: “Oper- 
ations Research Applied to the Office,” by Earl La.am; “Operations Re- 
search—A Boon to Marketing,” by J. H. Davidson; “Production and 
Inventory Control in a Chemical Process,’’ by Russell L. Ackoff; “Computa- 
tional Experience in Solving Linear Programming Problems,” by Wm. 
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Orchard-Hays; “Problems of Traffic and Transportation,” by William 
Prager; “Balancing Cost and Quality in a Department to Provide Lower 
Over-all Manufacturing Costs,” by Sherman Kingsbury; and “Scheduling of 
Batched Operations in Petroleum Products Pipe Lines,” by Tage A. Mor- 
tensen. Some of these papers are slight sketches, some are fairly detailed 
works. Finally, there is a report of a “free-for-all” general discussion in 
which, quite clearly, a good time was had by all. 

This symposium gives the reader a good general idea of the way operations 
research is moving in industry at this time and the practitioner will find a 
few examples given in sufficient detail to interest him. The industrial statis- 
tician who has not come across operations research before will wonder what 
all the fuss is about, since aine-tenths of the methods and problems will be 
familiar to him as applied statistics. Indeed, reading this symposium he will 
be inclined to wonder if operations research consists in calling everything 
a “model”—that word is used overmuch as a piece of jargon! Operations 
research is more than applied statistics; but it has long been recognized that 
the statistician who is to work successfully in any field must be more than a 
statistician. However, let us reflect on Churchman’s law of scientific growth, 
cease to bother about names, and get on with the job. 


“Statistics in Medical Research,” edited by Donald Mainland, 1954, 94 pp., 
Section III of Volume 6 of Methods in Medical Research. Chicago: Year Book 
Publishers. 


J. L. Hopass, Jr., University of California, Berkeley 


H1s brief work consists of fourteen sections; ten by Mainland (introduc- 

tion, chance and random sampling, the planning of investigations, the 
modern method of clinical trial, some undesirable effects of laboratory tradi- 
tion, confidence limits, standards of significance, standard deviations and 
standard errors, non-metrical tests of measurement data, and consultation 
with a statistician) and four by Mainland and Lee Herrera (analysis in re- 
lation to planning, clinical surveys, independent individual, and sample 
sizes). The work is organized as a single entity and will be reviewed as such. 

The material is addressed to the medical research man, and very wisely 
seeks to provide him with “not, primarily, more knowledge of statistical 
tests, but a realization of what modern biological statistics implies through- 
out the conduct of any type of medical investigation.” A few techniques are 
discussed, but only to illustrate concepts and principles. As one result, the 
book makes fascinating reading for the mathematical statistician, or the 
statistician working in some other field, who wishes to learn something about 
the particular nature of statistics in medicine. 

There is a great deal of good sense in the advice given, too little of which 
is to be found in many textbooks and courses on statistical methodology. 
A few quotations will indicate the flavor. “To omit randomization because 
one cannot see clearly how bias could occur is like trusting that glassware in 
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chemistry is clean because it does not look dirty.” “Unless the experiment 
has been designed and conducted in such a way as to avoid bias, do not 
apply a test of significance, or ask a statistician to do so.” “Corrections based 
on experiments with other subjects are always questionable; a sound experi- 
ment is self-contained.” “If there is any possibility of periodic fluctuations ip 
the readings . . . the observation times must be arranged to eliminate bias 
from that source.” “The method of ‘alternation’ (allocation of treatment A 
to the Ist patient, B to the 2nd, A to the 3rd, and so on) is not advisable, 
It too readily allows the manipulation of the serial order of patients by clini- 
cians who wish to steer certain patients to 1 or other of the treatments under 
test.” “Objectivity ... means ... an assessment unbiased by knowledge of 
the treatment that the patient is receiving.” “It still seems necessary to point 
out that the surveying of large numbers will not at all guarantee the removal 
of bias.” “If [treatments] V and W were in use at different parts of the period 
under survey, there is a very considerable risk of difference between patients 
who, according to the records, were apparently comparable.” “Great ef- 
fort at high precision in 1 part of an experiment may be wasted if the pre- 
cision in another part is low.” “ ... it is recommended that the emphasis 
should be placed, not on correlation, but on regression.” “The general rule 
should be not to reject observations without adequate knowledge of obser- 
vational and biological variation.” “.. . if 1 clinical treatment is applied in 
1 hospital ward and another treatment in another, however many patients 
are in each ward there are strictly only 2 independent individuals in the 
experiment.” “Unless ... pairing is based on something more than mere 
hope that the members of a pair would respond similarly to similar treat- 
ment, it is not an advisable technique” “ . . . something may be very highly 
significant, i.e., very unlikely to have occurred by chance, but not be very 
useful in practical application.” “A report should always... state the 
probability P for each result... . The reader can then judge the evidence for 
himself and apply his own standard of significance.” “Whether an observer 
grasps fully the implications of such quantities as the standard deviations or 
not, it is his duty to make clear to his reader what quantities he is using.” 
“Enlargement of sample size is affected by a law of diminishing returns with 
regard to precision.” 

There are reasonably good brief discussions of the rationale of randomiza- 
tion, design of clinical trials, spurious correlation and other pitfalls of clinical 
surveys, unintended dependence of observations, meaning of confidence 
limits, significance tests, and standard errors, and determination of sample 
size. A concluding section instructs the clinician in the abased manner proper 
when consulting a statistician. For example: “If the statistician points out a 
flaw in the research method or an invalid inference, accept his statement as 
unlikely to be wrong, even if you do not understand it.” 

There are inevitably in such a work a number of more or less minor 
and more or less technical points which may strike some readers as unfor- 
tunate, but to emphasize them in a review would distract attention from the 
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purpose of the book. This is not intended as a handbook of methods, but as a 
presentation of a concept of statistics which will unfortunately still be novel 
to many in the target audience. 

(The work under review appears as one of four sections of a volume. Read- 
ers of this Journal may also be interested in Section I, “Some methods of 
studying human genetics,” written by C. C. Li and edited by Antonio 
Ciocco. The other sections are not statistical in nature.) 


Clinical vs. Statistical Prediction, Paul E. Meehl. Minneapolis: University of 
Minnesota Press, 1954. Pp. x, 149. $3.00. 


Davip L. Wauuace, University of Chicago 


N THIS short monograph, the author presents an unbiased survey of 
| some theoretical arguments and empirical data in comparison of clinical 
and statistical methods of prediction. Statistical prediction is taken to mean 
(pages 15, 16) “that the prediction is arrived at by some straightforward 


_ application of an equation or table to the data. . . . The defining property is 
‘ that no judging or inferring or weighing is done by a skilled clinician. Once 


the data have been gathered from whatever source and of whatever type, the 
prediction itself could be turned over to a clerical worker.” The problem is 
of practical importance for (page 7) “ . . . therapists are in shortage. . . . The 
clerk or statistician cannot do therapy; hence it is of the greatest importance 
to ascertain whether the clinician can do a better job of prediction than they 
can. If he cannot, we are wasting his precious time.” 

The book consists of somewhat disconnected discussions of various aspects 
of the conflict between the two methods. “The style and sequence... 
reflect my own ambivalence and real puzzlement ...” (page vi). THere is 
no complete resolution of the controversy or even of the conflicting argu- 
ments contained within the book, yet many problems are clarified, and many 
difficulties resolved. 

The importance of validation of clinical (or any other) predictions is 
emphasized throughout (particularly in Chapters 5 and 10). Attention is 
given to claims that clinical predictions for single individuals are essentially 
different from statistical predictions. Various meanings for validation of such 
predictions are considered, such as the validation of the class of predictions 
by one clinician about patients of one sort. 

A major portion of the book is spent in describing some twenty empirical 
studies which provide comparisons between the success of clinical and statis- 
tical predictions. Defects in the design of the studies (for this purpose) are 
presented along with the empirical results. Common defects are different data 
used in making the two predictions, the use of clearly sub-optimal procedures 
in the statistical predictions, and the use of the same sample for establishing 
a prediction formula as for validation. The over-all results showed the statis- 
tical predictions equal or superior to the clinical predictions in all but one 


study. 
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Several interesting points are raised in the general discussion of the em- 
pirical results. In relation to the practical problem of predicting, should the 
statistical prediction be tested against the prediction of the “best” clinician 
or the “average” clinician? Is the “best” clinician stably such, and can he be 
identified? 

The author notes that the empirical studies are rather similar, and quite 
restricted in the nature of prediction, there being predictions of success in 
training or schooling, recidivism, and recovery from major psychosis. The 
range of alternative predictions is then usually a dichotomy or a simple 
scale, and the behavioral form of outcome is also rather special. 

The discussion at the end of Chapter 10 on the differing nature of pre- 
dictions and the difference between prediction in therapy and straight pre- 
diction links up with the lengthy discussion in Chapter 6 of the logic of 
clinical activity. A main contention here is that when the behavior being 
predicted is complex, sufficient experience to permit construction of actuarial 
tables will not exist, and that the clinician can sometimes arrive at conclu- 
sions through construction of some structural-dynamic hypothesis for the 
individual. A success frequency can be obtained for a clinician operating in 
this manner, but it is difficult, if not impossible, to construct a comparable 
statistical prediction. It appears possible that the clinical prediction will be 
best almost solely in those situations in which there are no statistical com- 
petitors. 

In Chapter 9, “Quantification of Clinical Material,” the author discusses 
the frequent need for nonlinear statistical procedures. He defines abstractly 
the concept of “patterning” of variables for the benefit of statisticians as the 
nonvanishing of mixed partial derivatives of the predictive function. The 
definition can be given in terms of interactions. The system may be described 
as patterned of order k if all (k+1) variable interactions vanish but at least 
one k variable interaction does not vanish. An unpatterned system is one for 
which all interactions vanish. 


The Statistics of Bioassay: with special reference to the vitamins. Chester /. 
Bliss. Reprinted with additions from Vitamin Methods, Volume II (Paul 
Gyérgy, ed.), New York: Academic Press, Inc., 1952. Pp. iii, 445-628. $3.50. 


JEROME CoRNFIELD, National Institutes of Health 


HE term “bioassay” refers to a method of measurement in which one 

estimates the amount of a substance present in a preparation on the basis 
of its biological effects. It is to be distinguished from a physical or chemical 
method of measurement solely by the fact that the measuring instrument 
used is a living organism and may consequently be subject to a considerable 
amount of variation. Even though alternative physical or chemical methods 
may exist, a doubt as to whether they are measuring that component re- 
sponsible for the biological effect of interest makes the use of methods of 
biological assay often inescapable. As long as the major problems of bio- 
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logical research were qualitative, use of these methods was restricted to a 
small number of fields, but with the increasing emphasis of recent decades 
on quantitative problems, bioassay methods have some into their own. Some 
of the newer areas in which bioassay methods provide a useful and in some 
cases indispensable tool of measurement, in addition to the traditional one 
of biological standardization, are virology, enzyme kinetics, endocrinology, 
protein metabolism, and vitamin research. 

The systematic substitution of statistical methods for rule of thumb pro- 
cedures in the field of bioassay began in the thirties, although there had been 
previous scattered applications. The statistical problems involved are far 
from trivial. The central problem, from which all others stem is: given (a) 
that the administration of a known amount of substance, X, is followed by 
some quantitative, but nevertheless variable, response of magnitude Y, 
(b) that the average value of Y over all pessible samples is functionally re- 
lated to X, and (c) that the administration of a preparation containing an 
unknown amount of this substance results in a level of response Y’, then, 
what is an appropriate estimate, X’, of the unknown amount of the sub- 
stance in this preparation and within what limits may the estimate be 
trusted? Rule of thumb procedures will take one only so far in a problem of 
this type, so that increasing numbers of research workers have felt the need 
of familiarizing themselves with the applicable statistical methodology. 

The present volume, originally published as a chapter in Volume II of 
Vitamin Methods, edited by Paul Gyédrgy, is now one of several texts de- 
signed to meet the research worker’s needs in this respect. Its author has 
been a distinguished contributor, both as a biologist and as statistician, to 
the theory and practice of bioassay for the past twenty years and is entirely 
conscious of the difficult problems of exposition involved. He has chosen to 
meet them by concentrating on the computational aspects of statistical 
methodology, and at the very outset warns his readers that, “it will not be 
possible to develop fully the biological logic of each procedure and its mathe- 
matical basis not at all.” The rerder who conscientiously works his way 
through all the numerical examples Bliss gives will absorb in the process a 
number of fundamental statistical ideas which will stand him in good stead 
in the practice of bioassay. Whether there are any easier ways of realizing 
this obje::tive is a question on which almost every statistician has an opinion 
and almost none has any evidence. 

This reviewer missed a discussion of the logic of bioassay. On first exposure 
to the bioassayer’s argument one is inclined to dismiss it as a particularly 
gross form of the fallacy of the undistributed middle. At first blush it seems 
as if the argument is: (a) this substance, say insulin, causes a particular bio- 
logical response, say a fall in blood sugar; (6) the unknown preparation in 
which I am interested also causes a fall in blood sugar; (c) it therefore con- 
tains insulin. The bioassayer is in fact more sophisticated than this. He 
knows that as a matter of pure logic his conclusion is correct only if there 
are no other substances in his unknown with direct or indirect effects on 
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blood sugar. If he seriously doubts the validity of this assumption, he wil] 
investigate the effect on blood sugar of those substances other than insulin 
that he might reasonably expect to be in his unknown. If one or more of the 
substances investigated has important effects he will attempt to remove it 
before assaying for insulin. He will become impatient, however, if the possi- 
ble mischievous effects of other uninvestigated substances are pointed out. 
The abstract possibility of error does not interest him. In this respect the 
bioassayer is like any other experimental scientist. He is interested in detect- 
ing and eliminating sources of error that might reasonably be presumed to 
be present; he is not interested in errors that could conceivably be present, 
but in whose existence he has no reason but formal logic to believe. 

Nevertheless the assumption that there are no other disturbing sub- 
stances in the unknown may never be strictly true, and in some important 
cases (e.g., the assay of ACTH), can be grossly untrue. To appreciate truly 
the havoc this can and does create one need only imagine the present state 
of physics if the ordinary balance were sensitive not only to the mass of the 
object being weighed but to a series of extraneous characteristics as well, 
such as its shape, chemical composition, internal energy and color, and if, 
in addition to this, repeated measurements on the same object were 100 
times as variable as they now are. The statistician may argue that such 
matters, while interesting and important, are no concern of his, and he may 
perhaps be right. But if these matters are not discussed in statistics texts, 
where is one to find them considered? 

Bliss emphasizes the prevailing point of view that each assay should be 
self-contained, and in particular that the dose-response curve for the stand- 
ard preparation should be determined anew in each assay. This reviewer 
would not endorse this position without qualification. Even when the dose- 
response curve for the standard is known to vary “significantly” ffom assay 
to assay it is by no means impossible that the use of a previous well-deter- 
mined standard curve will give a more precise estimate of the potency of an 
unknown preparation and that the degree of precision can be validly esti- 
mated using previous data on variation from experiment to experiment. The 
decision to allocate half of one’s experimental material to yet another de- 
termination of the dose-response curve for the standard should clearly de- 
pend on the magnitude and not the mere existence of variations in the level 
of this curve from experiment to experiment. 

There are other aspects of the subject on which one is tempted to write 
either a concurring or dissenting opinion, but to do so would give an entirely 
unbalanced idea of the book. We simply enumerate therefore the following: 
(a) although the Latin Square is a very useful experimental design in bio- 
assay, methods of analysis that are appropriate to the argicultural field 
trial may not be appropriate in bioassay. In particular, it may not be neces- 
sary as it is in the agricultural field trial to select the Latin Square at ran- 
dom, and in consequence the appropriate error terms need not be the same; 
(b) Bliss’ use of the same symbol, lambda, to stand for the ratio st. dev./ 
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slope in an assay with a continuous response and for the ratio unity/slope in 
an assay With all-or-none response is unfortunate, since if an assay with con- 
tinuous response and one with all-or-none response are characterized by the 
same value of lambda as defined, then they will not have the same inherent 
precision; (c) it would be desirable to mention the existence of other methods 
of handling the all-or-none response then the probit method. 

But to dwell any further on matters such as these would be misleading. 
J. H. Burn, writing in 1930, said that, “Biological assay, as carried out by the 
majority of workers in the world, still remains a subject of amusement or 
despair, rather than for satisfaction or self-respect.” Twenty-five years later 
the situation has vastly improved and at least partly responsible for this are 
expositions such as the present in which instructions on computation pro- 
cedures have been used as the vehicle for conveying statistical ideas. The 
book deserves a wide audience. 


Demographic Yearbook, 1954. Sixth issue. United Nations, Statistical Office, 
New York: 1954, pp. 729. $7.50 cloth, $6.00 paper. 


Kurt Mayer, Brown University 


CINCE 1948 the Statistical Office of the United Nations has published a 
» Demographic Yearbook containing regular series of comprehensive and 
aystematic international population statistics. Because of the large and in- 
creasing amount of new data available each year, the editors decided at the 
outset to adopt two major modes of presentation. Certain tables, regarded as 
basic or “standard,” are repeated in each issue with appropriate revisions 
and the introduction of new information. Other subjects and special topics 
are rotated, being featured only in certain years. Every volume contains a 
very useful cumulative subject index which makes it possible to . ‘etermine 
quickly what subjects and time periods were included in specific issues of the 
Yearbook. 

The basic tables which are repeated in every volume are concerned with 
censuses and estimates of the total population of over 200 geographic areas, 
with population density in these areas, with age and sex composition, and 
with basic vital statistics: births, deaths (including infant mortality), and 
marriages. Among the subject matter which is rotated are tables relating to 
marital status, economic characteristics, fertility, divorce, life tables, and 
international migration. In the 1949-50 volume the topics receiving special 
emphasis were marriage and fertility; in 1951 the spotlight was on mortality. 
The 1952 issue was devoted mainly to the geographical distribution of the 
population, including a special chapter on urbanization trends. The 1953 
volume was limited to the basic demographic tables only, but the 1954 
issue again presents a special emphasis, with natality the featured subject. 

The 1954 volume is the most extensive issue published to date. It contains 
a total of 44 separate tables. 20 tables deal with statistics of live birth or 
stillbirth, and 8 of these are entirely new. The introductory and explanatory 
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chapter “Technical Notes on the Statistical Tables” has also been expanded’ 
In every issue this section explains the scope and geographic coverage of the 
tables presented and provides a critical evaluation of the quality of the data 
both in a general sense, and specifically for each table. Like the preceding 
issues, this volume also contains a bibliography of recent official publications 
containing census data and other demographic statistics for each area of the 
world. The work concludes with a country and subject index of the tables 
contained in this volume. and the cumulative index mentioned earlier. 

There is no doubt that the current issue, like its predecessors in this series, 
constitutes a most valuable reference work both for the specialized scholar 
and for the general reader interested in comparative international population 
statistics. But looking back over the six volumes which have so far appeared 
in this series, it is perhaps timely to remind the editors of a hope, expressed in 
the introduction to the first volume, of including some of the many topics 
omitted from the first compilation by means of the rotation system. In 
quite a few cases this hope has not yet been fulfilled: among the important 
items which have remained consistently absent from the Demographic Year- 
book are data on race, language, and religious affiliation, for which existing 
statistics seem adequate enough to warrant their inclusion. The present re- 
viewer is also looking forward to a reappearance and extension of the tables 
containing economic and labor force statistics, not included in any of the 
last 4 volumes. In general, a somewhat one-sided emphasis on purely demo- 
graphic data at the expense of statistics pertaining to the social characteris- 
tics of the population seems to be the major shortcoming of this otherwise 
excellent series. 


Population Statistics and Their Compilation. Hugh H. Wolfenden. Chicago: The 
University of Chicago Press, 1954. Pp. xxiii, 258. $7.50. 


AnsLeY J. Coase, Princeton University 


PULATION Statistics and Their Compilation was first published in 1925. 

The book here under review is the second edition. The original chapters 
have all been expanded and amended in response to developments since the 
first edition, and two new sections (on forecasting mortality rates and on 
the theory of reproductivity) have been added. There is also a new appendix 
by W. E. Deming on “Some Theory in the Sampling of Human Populations.” 
Wolfenden is a distinguished actuary, and his book seems to have been 
written primarily for actuaries. He says in his preface (p. viii) “ . . . special 
attention has been paid to the needs of actuarial and other students... ,” 
but it is clear that the actuarial rather than the other students were upper- 
most in the author’s mind. The central half of the book is occupied by the 
problem of the preparation of mortality tables. 'n most instances even non- 
actuarial topics are treated with emphasis on points of special interest to 
actuaries. For example, in discussing “the reliabiiity of census and registration 
statistics, and the nature of errors therein” (section IV), the author restricts 
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his analysis to misstatements of age and omissions of certain age groups in 
censuses, and to defects in vital registration—the errors which present prob- 
lems for the maker of mortality tables. Misstatements of school grade com- 
pleted, occupation, income, race, etc., are not discussed. The half page 
devoted to discussing * ‘atistics of unemployment emphacizes their usefulness 
in calculating the costs of unemployment insurance—an actuaria. problem. 

Despite the emphasis on the actuarial point of view, the non-actuarial 
reader will find rewarding material in the book. The opening sections on 
population censuses and the registration of vital statistics contain useful 
summaries of the principal methods employed in various countries. One 
of the new chapters contains a succinct description of the most important 
concepts which have been developed for analyzing reproductivity. It also 
summarizes the limitations of these concepts. Not least, the non-actuarial 
statistician or demographer can get from this book, if he is willing to make 
the effort, an authoritative introduction to actuarial techniques and an un- 
derstanding of the principal problems underlying the construction of mor- 
tality tables. 

One of the features of Population Statistics and Their Compilation is the 
large number of references to demographic, statistical, and actuarial litera- 
ture that it contains. Five to ten references per page is not unusual. Thus the 
reader is provided with a running bibliography for each topic. Unfortunately, 
the references are nowhere brought together in a systematic list. Some of the 
references occur in footnotes, most are set in parentheses in the text, and 
many are part of the text itself. An annotated bibliography arranged by sub- 
ject would be a more usable guide to the literature. Deletion of the refer- 
ences from the body of the text would also make material which is difficult 
enough more readable. There is no index, but a detailed analytical table of 
contents. 





Addendum to Demography, Developments during 1949-54. Peter R. Coz, Cam- 
bridge, England: Published for the Institute of Actuaries and the Faculty of 
/-<varies at the University Press, 1955. Pp. viii, 64, 5s. net. 


MortTIMER SPIEGELMAN, Metropolitan Life Insurance Company 


His addendum brings up to date the content of the author’s Demography 

which was published in 1950.* The new material is keyed to the para- 
graphs of the original volume so that it may be read in conjunction with it. 
Apparently a revised volume is in preparation. 

The addendum contains succinct summaries of major British develop- 
ments in demography within the last five years. These include, principally, 
the reports of the Royal Commission on Population and those that have so 
far been published on the 1951 Census of Great Britain. Together with the 
summaries are included comments that have been offered on these reports 





* See review by Frank Lorimer in this Journal, Vol. 47 (1952), p. 349-50. June 1952. 
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by a number of critics. The summaries and comments, taken together, make 
a convenient reference. 

A number of other topics are treated by the author, such as developments 
in vital registration in Great Britain, advances in morbidity statistics, meas- 
ures of infant mortality, and a limited number of recent demographic data 
for Great Britain and the world. 

Although intended essentialiy for actuarial students within the Common- 
wealth, the addenaum and it= parent volume provide, more generally, a 
valuable insight into British demographic thought and practice. 


Cohort Fertility: Native White Women in the United States. Pascal K. Whelp- 
ton. Princeton: Princeton University Press, 1954. Pp. xxv, 492. $6.00. 


Cuar.es F. Westorr, Princeton University 


HE major factors motivating the preparation of Whelpton’s monograph 

are the analytical and interpretive problems created by the interruption 
of the long-time trends of the birth rate in the United States. It is well known 
that annual birth rates and the average size of completed families had been 
on a downward trend in this country for a long period of time, trends that 
were especially evident in the depression of the ’thirties. Many demogra- 
phers, including Whelpton, developed population projections which involved 
the assumption of a continuing decline or stabilization of these low birth 
rates. In the light of the recovery of the birth rate in the ’forties, it was not 
surprising to see how far short these forecasts fell. Whelpton, among others, 
has been reluctant to accept the view that the sharp rise in annual birth 
rates implies a reversal of the trend toward smaller families. He has per- 
sistently maintained that one must apply sharper analytical techniques to 
birth data in order to ascertain whether the recent changes have not been due 
more to “temporary” changes in the time*at which married couples decide to 
have their children, than to any basic alteration in the number of children 
ultimately reproduced. The application of cohort analysis, the calculation 
of the cumulative number of births occurring throughout their reproductive 
period to a group of women born at the same time, represents Whelpton’s 
attempted solution of this problem. 

The monograph accomplishes two primary objectives: the detailed de- 
scription of a technique of fertility measurement (cohort analysis) and the 
presentation of over 100 pages of fertility tables for native-white cohorts 
born in each year from 1875 to 1933. The fertility rates included in the 
first tables are cumulative birth rates by order of birth by age of woman 
(Tables A, B). The next set of tables presents the distribution by parity 
of the women surviving to each age from each initial birth cohort (Tables 
C, D). The following two tables (E, F) present the celculations of the prob- 
ability of bearing a child of n order during each age for the women of n—1 
parity living at the beginning of each age. The series is concluded with two 
tables (HA, HC) which relate to hypothetical cohorts up to the year 1948. 








1955 


ake 
nts 
ata 


on- 
ra 


ph 
ion 
wn 
en 
lat 





poOK REVIEWS 1375 


These latter tables are constructed on the principle of life tables and re- 
production rates. 

The text is based largely on the rates presented in these basic tables. 
The author enumerates in elaborate detail the kinds of biases and errors 
from such sources as underenumeration and underregistration. The whole 
tone of the book, as a matter of fact, is of this nature. Whelpton’s writing 
is characterized by extreme cautiousness and fidelity to detail. 

In substantive terms, the author’s historical analysis reveals the pattern 
of a recent making up of births postponed during the depression, an abnor- 
mally high rate for first births partly as a consequence of high marriage 
rates, and a long-time decline in family size as a result of sharp decreases in 
higher order births. His concluding chapter on the future course of fertility 
suggests that little change will occur in recent patterns of age at marriage. 
His medium forecast for the total birth rate for the cohorts of 1¢45-49 
represents approximately the estimated rate for women of the 1905-09 
cohort, although some intervening cohorts will probably at least temporarily 
interrupt the historical downward trend of comp'eted family size. His fore- 
casts of additions to cumulative birth rates from 1949 to 1963 indicate 
fewer first order and fewer high order births than in 1944—48, and more births 
particularly of third to sixth orders. The general forecast indicates a sub- 
stantial increase of total births up to 1954 and a decrease thereafter. The 
forecasts prepared are not mathematical projections of past trends but em- 
pirical estimates of the most probable events based on the author’s opinions 
of trends in age at marriage, attitudes toward family size, the treatment of 
sterility, and the like. His High, Medium, and Low categories are chiefly 
determined by explicit assumptions of differing future economic conditions. 

From the point of view of contributions to statistical theory, nothing is 
claimed in this volume. The contribution made to the methodology of pop- 
ulation research is contained in the author’s consciousness of the necessity 
of analytical refinement and the use of statistical controls, as well as in his 
critical appreciation of the need for actual measurement through genera- 
tional time. It would be something of an understatement to say that the book 
is difficult reading. This derives not from any inherent complexity of the 
tecnniques described or the analysis presented, but rather from the technical 
nature of the subject itself and from the frequently cumbersome labels 
attached to the rates presented. 

One general criticism that has been levelled at Whelpton’s work deserves 
reiteration here. His attention to completed fertility occasionally results in 
the treatment of other fertility variables, such as annual birth rates, the 
timing of births, and so forth, as variables requiring control or as factors 
tendi: ¢ to obscure “real” trends. Although his conceptualization of such 
factors is appropriate to the analysis of completed fertility, the unfortunate 
picture results that other fertility variables are somehow less legitimate con- 
cerns, a point of view certainly not held by many social scientists and prob- 
ably not intended by the author himself. 
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With respect to his forecasts, Whelpton voices the now standard defensive 
complaint that economists would have more justification in criticizing popu- 
lation projections if their own forecasts were more accurate. It would be more 
helpful at this juncture to declare a cessation to mutual recrimination and 
direct this energy toward collaborative effort. 

There is no doubt that Whelpton has made a basic and significant contri- 
bution to the methodology of demography, both in terms of the research tech- 
niques developed and the construction of useful basic data tables. 


Economic Problems of an Aging Population. Part 2 of Retirement Policies and 
the Railroad Retirement System. Report of the Joint Committee on Railroad Re- 
tirement Legislation. Washington, D. C.: United States Government Printing 
Office, 1953. Pp. xvii, 172. $0.45. Paper. 


Ernest W. Burcess, University of Chicago 


HIS report is issued by the Joint Committee on Railroad Retirement 

Legislation of the Senate and House of Representatives, Paul H. Douglas, 
chairman; Robert A. Wallace, staff director. Its purpose was to present a 
comprehensive survey of the economic problems of the aging population 
derived from over 100 sources by summarizing “existing statistical and other 
data regarding the economic problems caused by aging, generally, in employ- 
ment and retirement.” It was addressed not only to members of Congress 
but to private citizens and public agencies active in this field. It is factual 
and analytical and makes no recommendations. 

A few main findings of this report may be briefly stated. The proportion 
of persons 65 years and over has greatly increased because of the increase in 
life expectancy and of the decrease in proportion of children and younger 
persons. The proportion of older persons in the labor force has steadily de- 
clined. Those still employed have small incomes in comparison with the total 
working population but more adequate than the incomes of the retired. 
Self-employed individuals constitute only 16 per cent of the entire labor 
force but 40 per cent of those 65 years of age and over. Older as compared 
with younger persons are at a disadvantage in having lower incomes, poorer 
housing, higher incidence of chronic and disabling illnesses, more need of 
medical care and less hospitalization insurance. 

Findings of studies are presented showing that older workers compare 
equally and in some instances favorably with lower-age brackets in the 
frequency of disabling illness, in lower accident and absentee rates, less 
labor turnover. The results of a survey of over 3,000 companies by the 
National Association of Manufacturers in 1951 is quoted in which 99 per cent 
of companies reported workers 45 years and over as superior or equal to 
younger workers in work attitude and 93 per cent report them superior or 
equal in work performance. 

The growing importance of retirement is indicated by the doubling be- 
tween 1900 and 1940 of the number of years a man of 60 can expect to spend 
in retirement. 
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The report contains 75 tables and 33 charts. A valuable feature is a digest 
of 108 recent selected references on the problems of aging. It has an excel- 


lent index. 


America’s Resources of Specialized Talent. Prepared by Dael Wolfle. New York: 
Harper and Bros., 1954. Pp. xvii, 332. $4.00. 


Arruur H. Brayrie.p, Kansas State College 


HE wise utilization of human talent is a central problem in a democracy. 
Torgent national concerns must be balanced against the ideal of free in- 
dividual choice of occupation. Special attention to the gifted may en- 
counter public distrust of an intellectual elite. 

The appropriate starting point for any constructive efforts is careful ap- 
praisal based on adequate factual data. This the Commission on Human 
Resources and Advanced Training has attempted to provide in a long- 
awaited report by Dael Wolfle. The Commission, an outgrowth of the expe- 
rience with manpower problems of four national research councils in World 
War II, spent nearly four years and approximately a quarter-million-dollar 
Rockefeller Foundation grant in the development and analysis of relevant 
information. 

This volume summarizes a comprehensive survey of the specialized man- 
power resources and requirements of the United States. The Commission 
was concerned with “those persons who are educated, intellige»:, able to 
work with ideas, and qualified to plan and understand and direct the aation’s 
complex web of industrial, technological, social, scientific, and governmental 
institutions and problems.” Its studies were organized around three inter- 
related problems. First, what is the present supply and what ir ‘t like? 
Second, what are the demands now and what are they likely to be in the 
future? Third, what is the potential supply? 

Among the methodological problems confronting the Commission staff 
at the outset were the necessities for (1) a sharpened definition of specialized 
personnel; (2) a classification o° fields of specialization; (3) criteria for 
determining membership in a field, and, closely related; (4) criteria for 
determining the allocation of jobs to fieids. These problems were resolved in 
a fairly satisfactory manner and represent a substantial contribution to 
manpower studies. 

Where possible, the Commission used readily available data. In addition 
it conducted, or contracted for, first hand investigations. Although an appen- 
dix particularizes some of the methods of data collection and analysis it is 
not possible to evaluate the procedures adequately. The Commission was 
aware of limitations and Wolfle candidly says that “one could well ask for 
improvements in practically every type of statistical finding and estimate 
made in this book.” But the figures “are offered as the Commission’s best 
estimates for the United States as a whole in the middle of the twentieth 
century.” And the fact that earlier fragmentary studies bearing on some of 
these same issues support many of the Commission’s conclusions gives one 
considerable confidence in the major outlines drawn in this report. 
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A detailed account of the findings is obviously impossible. The following 
gives something of the nature of the enterprise. 

The proportion of college graduates in the appropriate age range has 
steadily increased and may be expected to continue. The composition of the 
college graduate population is constantly changing. Smaller proportions 
come from the humanities and the natural sciences. The rapid increases 
have come in education and business administration which now account for 
nearly a third of all bachelor’s degrees! The social sciences show a fairly 
steady relative increase. 

About 2 per cent of bachelor’s degree recipients go on to the doctorate; 
the percentage is slowly increasing. The distribution of doctor’s degrees 
among fields of specialization has been comparatively stable, with the natu- 
ral sciences accounting for the largest percentage of degrees granted. Wolfle 
suggests that the character of the degree may change with “applied” degrees 
increasing. 

Continuance in the field of specialization comes in for extensive consider- 
ation. About one graduate student in three shifts his specialization from his 
undergraduate major. Approximately 38 per cent of employed college grad- 
uates work in the field of their undergraduate major. Shifts are most likely 
to occur among liberal arts and sciences majors. Contary to the frequent 
interpretation of such findings as evidence of the inadequate utilization of 
human resources, the Commission points to a positive social value in the 
form of desirable flexibility in the labor supply. It is interesting to observe 
that economists have been making a similar virtue of labor mobility. These 
are prosperous times. 

Short-run supply and demand projections made by individual areas of 
specialization were based on a variety of sources of information of apparently 
uneven value. Although these projections make interesting reading (unfor- 
tunately statisticians were lumped in with other groups), the attempt high- 
lights the very great need for adequate data. 

It was not thought feasible to estimate the long-range potential supply 
for individual fields, Instead, an analysis was made of the potential for 
college attendance and the factors affecting attendance and graduation. 
This portion of the report is fairly conventional but does bring the documen- 
tation up to date. A major conclusion is that a lot of bright youngsters do 
not get to college, and the important reason seems to be a lack of adequate 
motivation. 

Also documented anew is the frequently observed fact that “students in 
some fields are more highly selected in terms of intelligence test scores than 
are students in other fields.” (Could the great increase in education and 
business administration enrolments be related in part to the finding that 
these students are well toward the bottom of the ability hierarchy?) The 
report neglects the comparatively large literature dealing with the differen- 
tial interest patterns of students entering and continuing in specialized fields. 
It also minimizes the practical use of differential aptitude measurement to 
an extent that disturbs me. 
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The discussion of methods for improving the utilization of the potential 
supply covers familiar ground but has the virtue of bringing various pro- 
posals together in a systematic appraisal of their possible effectiveness. It 
may be news to some that scholarships and financial aid are of limited 
value. 

Although the amassing of comprehensive data of the type presented here 
is a very worthwhile contribution, particularly as it focuses national atten- 
tion on the problem, the reviewer felt “let down” after completing the book. 
A half dozen adequately designed studies directed to specific aspects of the 
problem might have contributed really new knowledge. For example, recent 
advances in the measurement of motivation for achievement make it pos- 
sible to test empirically hypotheses regarding the influence of different 
variables upon such motivation. Controlled experiments with college attend- 
ance as the dependent variable are feasible. Support of a really adequate 
venture in differential prediction with, say, the Yale test battery, has po- 
tentialities. The commission missed a fine opportunity to outline, at least, 
needed research. 

In summary, the Commission’s report has general information of real 
value. It demonstrates the difficulties in collecting relevant data and force- 
fully reminds us of the need for continuing, systematic appraisal. The recom- 
mendations for improved utilization of specialized manpower make sense. 
An emphasis on freedom of occupational choice permeates the entire report 
and is impressive evidence of the faith of the Commission and staff in the 
validity of the democratic process. 


The Uneducated. Eli Ginzberg and Douglas W. Bray. New York: Columbia Uni- 
versity Press, 1953. Pp. xxv, 246. $4.50. 


Z. W. Birnspaum, University of Washington 


ow World War II, more than 700,000 men were rejected by the 
Armed Forces on the grounds that they were “mentally deficient,” and an 
additional 300,000 were rejected on the same grounds in the year following 
the outbreak of fighting in Korea. These startling figures brought into 
focus the basic problem treated in this book: the waste of human resources, 
possibly due to the fact that a sizeable fraction of the population appears 
unable to perform mental tasks which are supposed to be taught in the 
first few years of elementary schooling. 

In the first part of the book entitled “Education and Scciety” an attempt 
is made to trace the history of this defect in the development of elementary 
human resources in the United States. Narrowing it down, somewhat arbi- 
trarily, to “illiteracy,” the authors make use of census data between 1870 
and 1950 to uncover the trends within various social groups and geographic 
areas of the United States. Particularly striking differences are observed 
between the native born white and the Negro population, and between the 
states in the Northeast and Far West on the one side and those in the South- 
east on the other. The rejection data of World War II are then broken down 
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and analyzed in a similar manner according to geographical regions and two 
main groups: white and Negro. Remarkable relationships are observed be- 
tween rejection rates and educational expenditures per pupil, per capita 
income, and percentage of rural population in the different states. 

The second part of the book, under the heading “Military and Civilian 
Performance,” deals with the poorly educated in the Armed Forces during 
World War II. The mobilization plans, prepared against a background of 
the large scale unemployment of the 1930’s, did not contemplate the possi- 
bility of a shortage in the nation’s manpower resources, but aimed at secur- 
ing for the Armed Services the better men out of the total pool. In the 
beginning of the war the Army inducted only men who had “the capacity of 
reading and writing the English language as commonly prescribed for the 
fourth grade in grammar school.” With the steady increase of manpower 
shortage, however, the Armed Forces had to relax these admission standards, 
From 1943 on, a psychological test was substituted for the criterion of four 
years of school or equivalent training, and those who passed the test, even 
if illiterate, were inducted and assigned to Special Training Units in which 
they were to be educated to a level corresponding to at least the fourth 
grade. 

A major effort has been made by the authors to study the effectiveness of 
these Special Training Units, by exploring the civilian background of soldiers 
assigned to such units and their adjustment to civilian life, comparing those 
who were graduated with those who failed to graduate from the STU’s, 
and appraising the performance of the graduates in subsequent military 
service. The approach to this group of problems is methodically sound: 
a sample of individuals assigned to STU’s was obtained, the records for each 
of them were traced down to a remarkable detail, and these records were 
used as material for a statistical analysis. The statistical design for the col- 
lection of data, however, could have been improved. The soldiers in the 
STU’s were classified into eight groups according to the categories: Northern, 
Southern; White, Negro; year 1943, year 1944. A sample of 50 individuai 
records was then obtained from each of the eight groups (by a procedure 
which is not described in detail). While the over-all sample size of 400 may 
be large enough, the allocation of equal quotas of 50 to the eight strata 
regardless of their size can hardly be considered efficient statistical pro- 
cedure. 

One of the salient conclusions reached by the authors in the study of the 
400 individuals is that this substandard human material was to a very great 
extent rehabilitated by the STU’s and re-made into good soldiers. A com- 
parison of STU graduates with a contrel group consisting of soldiers who 
did not require remedial training suggests that the STU graduates performed 
less well than those in the control group but nevertheless the STU’s repre- 
sented a clear gain to the Army. 

The study dealing with the Special Training Units was rounded off by 
following up the graduates of these units after their return to civilian life. 
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The information obtained, although fairly incomplete, seems to indicate that 
a majority of the graduates considered the training received in the STU’s 
as helpful in their lives. 

In view of their finding that the main concentration of uneducated is in 
the South, the authors undertook a study aimed at finding out to what 
extent the problem of those lacking the most elementary education affects 
the supply of labor for industry in the South. The evidence gathered appears 
to present the following picture: the educational facilities are still very un- 
even, and the poorly educated and the uneducated have found consider- 
able difficulties in making a place for themselves in southern industry; 
industrial management in the South, however, is generally satisfied with 
the educational background of its working force, due to the fact that southern 
industry, although growing rapidly in recent times, has lagged behind the 
expansion of southern education and has encountered no difficulty in ob- 
taining all the workers with the required educational background it needed. 

A separate chapter entitled “The Uneducated Migrant” is devoted to 
the problem of the uneducated among those who, mainly due to economic 
pressure, change their domicile and try to make their living elsewhere. 

In the third part, under the heading “Human Resources Policy,” the 
authors attempt to interpret the results of their study in terms of implica- 
tions for public policy. After presenting considerable material on the develop- 
ment of schools in the geographic area which * »y have found to be most 
critical, the South, and carrying out comparisous between the South and 
the rest of the country, they utilize the information so assembled to discuss 
the “limited question of whether the white and the Negro children born in 
the South during the 1940’s and 1950’s, including the more impoverished 
rural areas, now have an opportunity to acquire a basic minimum educa- 
tion.” The authors admit that the evidence is not clear cut. They point out 
that there has been an increased effort made by the southern states to close 
the gap between their rural and urban schools and particularly between 
Negro and white schools, and that the southern states have been raising 
their educational standard rapidly. In spite of these striking improvements, 
some numerical evidence and opinions expressed by various state depart- 
ments of education lead the authors to the statement that “the fact must be 
faced that a considerable, if unknown, number of children, particularly in 
the rural south, and more particularly among the Negroes, will fail to com- 
plete four grades of schooling and acquire basic literacy.” 

In a chapter “The Armed Services” the present practice of the Armed 
Forces of rejecting those individuals who fail to pass the Armed Forces 
Qualifications Test (AFQT) is subjected to a critical discussion. Among 
other things, it is pointed out that by this procedure the Armed Forces may 
be losing a substantial number of potentially good soldiers who could be 
reclaimed by a specialized training program. 

In a concluding chapter, “Human Resources Potential,” the authors 
summarize their findings with the aim of formulating desirable actions for 
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the future. Having localized the focus of the problem in the southern states 
and, to a very much smaller extent, in such groups as migrant workers, they 
reach the conclusion that an effective remedy would consist in Federal 
action directed towards raising the level of education in the economically 
less well-to-do states. This, in their opinion, would attack the evil at the 
source, while such steps as admitting poorly educated individuals to the 
Armed Forces and establishing remedial training for them would help reduce 
its effects. 

The book makes interesting reading and the conclusions are in essence 
well substantiated. One can not help, however, thinking that the quality of 
the presentation and the validity of the findings might have been consider- 
ably better had this large scale investigation utilized some modern statistical 
techniques. Although the book relies heavily on statistical evidence, the 
statistical methods used do not go beyond tabulating percentages. The word 
“significant” is used to mean that some figures appeared to be significant 
te the writer. As already mentioned, the planning of the study of the Spe- 
cial Training Units and the evaluation of its results could have been improved. 
There also are quite a few slip-ups, such as 11.5 per cent being referred to 
as “one in ten” (p. 21), or a reference to a non-existing table (p. 31, sec- 
ond paragraph), or statement “The reader will recall that 38 graduates 
... were so classified” (p. 110) which made this reviewer search for the 
number 38 throughout the book. 

In spite of minor shortcomings, The Uneducated is a thought-provoking 
book. It formulates and discusses a real and important problem and points 
out possibilities of attacking it. It certainly constitutes a constructive ap- 
proach to the area of conservation of human resources and makes the reader 
very much aware of the need for much further work in that direction. 


Towards an Understanding of Juvenile Delinquency. Bernard Lander. New 
York: Columbia University Press, 1954. Pp. xv, 143. $3.00. 


C. C. VanVecuten, U.S. Navy Bureau of Ordnance 


HE basic materials of this study are 8,464 official records of presumably 
different individuals before the Baltimore Juvenile Court over the four 
year period 1939-42, and the 1940 census tract data for the city. 

The problem of definition is recognized. Lander uses the term “juvenile 
delinquent” to mean “brought into court on a delinquency petition.” He has 
no data on the characteristics of census tracts which influence the proba- 
bility that specific acts will result in formal hearings and does not speculate 
on them. 

Statistical techniques include zero order, partial and multiple correlations. 
A chapter is devoted to factor analysis. Parabolic functions are fitted to 
data that appear non-linear. Significance levels are meticulously stated. 

{ In addition to delinquency, the factors used are medians for years of school 
completed and monthly rent and percentages for overcrowding, home owner- 
occupied, home substandard, non-white, and foreign born. 
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Lander is primarily interested in examining the Shaw-Burgess concepts of 
the causation and ecological distribution of juvenile delinquency. This re- 
viewer finds it possible to agree with many of the concepts and conclusions 
without agreeing that they contravert the Shaw-Burgess idealized ecological 
models. The difference seems to center on whether the “Chicago School” 
ecological model is taken as a rigid statement of what “should” uniformly 
exist or as a useful scientific fiction to aid in thinking and valid in its details 
only when other factors are constant. Lander seems to make the former 
assumption and feels he has disproved the Shaw-Burgess contentions when 
he can demonstrate specific exceptions even though the exceptions are just 
what any ecology student would predict given all the facts. 

For instance it is hardly startling that there are high rates contiguous 
to an industrial area incorporated into the city 256 years after its founding. 
Nor is it startling to find that the role of the immigrant has changed now 
that he has been displaced by the Negro as newcomer and low man on the 
economic totem pole. One highly interesting fact, new to this reviewer, is 
the demonstration of a marked curvilinear relationship between delinquency 
and percent Negro with the rate maximum where the ratio is about 50-50. 
Old areas of stable Negro occupancy showed up with very low rates despite 
extremely poor status on the economic indexes. 

One learns a great deal about Baltimore although it is regrettable that with 
13 maps of the city none identifies the tracts so constantly referred to in 
the text. 

Lander finds the delinquency rate fundamentally related to anomie (which 
term he prefers to “social disorganization” as indicating isolation from the 
pressures for conformity within the dominant culture) with socio-economic 
factors in a distinctly secondary role. 

The distinction between cause and covariance is carefully maintained. 
This reviewer has some questions about some of the statistical techniques; 
perhaps because it is not always clear just what happened. (Are all recidi- 
vists counted only in the first year, and only at the original residence? Why 
fit parabolic curves rather than use exponential transformations?) Particular 
exception must be taken to the contention that factors having zero order 
correlations of .69 and .73 have “no substantive relationship”... (with)... 
delinquency” because they have essentially zero partial correlations. This 
involves assumptions of orthogonality and normality in the rest of the data 
that are at best dubious and, as applied to the group of factors, would 
justify the conclusion that the sum of the contributions of the factors studied 
accounted for less than 10% of the variability in delinquency. 

It is regrettable that typographical errors are generously sprinkled 
throughout the book starting on page one. The reader sometimes, as in 
Table XVIII, finds it difficult to supply corrections. 

Despite limitations this book will be a “must” for anyone contemplating 
the use of census tract data to establish basic relationships between ecological 
and sociological phenomena. 
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Occupational Mobility in the United States, 1930-1960. A. J. Jaffe and R. 0, 
Carleton, New York: King’s Crown Press, 1954. Pp. xiv, 113. $2.75. Paper. 


HERBERT GoLDHAMER, The RAND Corporation 


TT primary aim of this study is to provide estimates of the size of the 
United States male working force and its distribution among ten major 
occupational groups for the year 1960. Four projections are provided, two 
on the assumption of depression conditions similar to the ’tiurties sad two 
on the assumption of prosperity conditions similar to the ’forties. J'he two 
projections of each of these pairs are distinquished hy their incorperation of 
different assumptions concerning the number of men in the armed forces. 

Utilizing the 1930, 1940, and 1950 censuses of occupation the authors 
trace the occupational composition of successive age cohorts through these 
time points, each age cohort then being projected to 1960; the major occu- 
pational groups are the summation of the age cohort projections. The 
changes in each age cohort through time are derived from estimates of four 
components: deaths, retirements, new entries, and net mobility (inter- 
occupational shifts). These component projections lead to final estimates 
rather different from and doubtless more accurate than those that might be 
secured from simple extrapolations of past census occupational distributions. 

The civilian male 1960 working force is estimated to range from a mini- 
mum of 45.0 to a maximum of 46.8 million (as compared with 42.3 million 
in 1950). The estimates of working force distribution among the ten major 
occupational groups are relatively insensitive to the two different assump- 
tions on the size of the armed forces. The “depression” and “prosperity” 
assumptions are introduced by assuming for one projection the retirement, 
new entry, and net mobility rates of the thirties and for the other projection 
those of the ’forties. Professional, technical, and kindred workers show 4 
percentage increase in 1960 (over 1950) under both sets of assumptions; 
farmers and farm managers a decrease in both cases; and sales workers and 
operatives virtually no change in both cases. Occupations showing an in- 
crease under depression assumptions and a decrease under prosperity as 
sumptions are: service workers, farm laborers and foremen, and laborers 
except farm and mine, Occupations showing a decrease under depression 
assumptions and an increase under prosperity assumptions are: managers, 
officials and proprietors (except farm), and craftsmen, foremen, and kindred 
workers. 

To provide these estimates the authors had to reduce the 1930, 1940, and 
1950 censuses of occupation to a common occupational classification, a task 
which they have performed with exemplary care, although their success was 
necessarily qualified at some points by the limitations of the census data. 
A considerable part of the volume provides analyses of the separate com- 
ponents that went into the final working force distribution estimates. The 
component data, however, while constituting a proper basis for the final 
working force estimates have relatively little value in themselves. This is 
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particularly the case with the data on net occupational mobility which turn 
out, after closer inspection of the methodological section, to reveal little of 
interest on inter-occupational changes. The authors have, however, taken 
material from Gladys Palmer’s Labor Mobility in Siz Cities (New York: 
Social Science Research Council, 1954) which contains data on the occupa- 
tional changes of specific individuals and reworked these into an inter- 
esting table (Table 13) which shows men classified by major occupational 
group both at the beginning (ages 15-24) and end (ages 55-64) of their 
working careers. This table would have been of still greater interest had the 
initial age group been somewhat older. Among the 15-24 age group are 
many young persons whose initial jobs are of a transient character. To com- 
pare these with the jobs they hold at 55-64 is not of much interest. None- 
theless this table is decidedly useful and does more to justify the title of the 
work than the authors’ lengthier discussions of net mobility. 


Immigration and the Foreign Born. Simon Kuznets and Ernest Rubin. New York: 
National Bureau of Economic Research, Inc., Occasional Paper 46, 1954. Pp. 
xii, 107. $1.50. 


Howarp G. Brunsman, Bureau of the Census 


Ps I of this brief report contains an analysis of the basic trends in the 
foreign-born white population of the United States and in immigration into 
the United States in the period since 1820. The contribution of the foreign 
born to total population and to the labor force, and the cyclical variation 
in immigration and the relationship of these variations to business cycles 
are considered. 

Part II of the report presents a summary statement of the methods used 
in preparing estimates of the foreign-born white population of the United 
States based on the migration-survival method by decades since 1880. 
These estimates are presented by sex and age groups and are compared with 
the statistics obtained in the decennial census. Estimates of number of for- 
eign-born white in the labor force are also presented. The fluctuations in 
these various series are analyzed in greater detail than in part I. 

Part III of the report contains a thoroughly-documented statement of 
the statistical methods followed in evaluating the series, in preparing the 
estimates, and the problems faced in this procedure. 

It is stated in this report that “Restriction of immigration had at least one 
result that may have decreased the stability of the economy... . Swings 
in immigration and in the rate of increase of native-born and native parents 
are roughly synchronous. ... But swings in rate of increase of native- 
born of foreign parents lag a decade behind immigration. ... Because of 
this difference in phase, the net additions to population from the two sources 
together are steadied.” An examination of table 9 of the report shows greater 
stability in the native-born of native parentage than in total population. 
This indicates that the damping effect of the lag in births to the foreign 
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born is not sufficient to offset the greater range of fluctuation in immigra- 
tion. 

The estimates of the number of foreign-born white show basically the 
same level and fluctuations as the figures obtained in the population census, 
This similarity is especially gratifying in view of the quite serious limita- 
tions of the basic data, including the limitations of the census counts for 
the earlier years. 

It is refreshing to examine a brief report with a thoroughly-documented 
statement of the methods followed in preparing the statistical series which 
are presented. Most analysts are impatient to interpret the results of the 
studies and devote but little space to the explanation of how the basic series 
were developed. Progress in the analysis of data is advanced when a student 
is able to build on the work of his predecessors. This cumulative approach is 
feasible only when the student is able to review and evaluate the statistical 
basis of the earlier work. Part III of this report may be commended as an 
excellent example of the proper presentation of the methods and limitations 
of the basic underlying data. 

This is an interesting and workmanlike job and one must agree with the 
authors that “the area of investigation touched by this paper has long been 
slighted and might richly repay more intensive research.” 


British Post-War Migration. Julius Isaac. National Institute of Economic and 
Social Research, Occasional Paper XVII. Cambridge: Cambridge University 
Press, 1954. Pp. xix, 310. $6.00; 30s. 


Wiiu1aM Petersen, University of California, Berkeley 


A ALL who have used them know, international migration statistics are 
not very satisfactory. The definition of the area is vague, since the arbi- 
trary distinction between migrants and visitors leaves many ambiguous cases; 
the count is typically neither complete nor accurate; the classification is 
usually into categories that are more or less irrelevant to an analysis of 
population trends. 

Together with an anonymous “post-graduate research worker,” Isaac col- 
lated the information given on shipping manifests for a ten per cent sample 
of all British emigrants during 1946-1949. This independent set of data made 
it possible for him to discuss intelligently matters not included in une Board 
of Trade passenger returns—for example, the emigrants’ family structure 
and the region in Britain of their prior residence. With these data, he also 
subjected the standard statistics to a thorough critical review, and by the 
time the volume was published some of his suggested improvements had 
already been adopted. This section of the volume, roughly the first third, 
is at once the most original and the most stimulating, and well worth the 
close attention of anyone concerned with the analysis of migration trends. 

There follow five chapters on the countries of future residence, with par- 
ticularly detailed discussions of Australia and Canada; three chapters on 
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immigration to Britain; and two summary chapters, one on policy and one 
on statistics. 

The discussion within this framework offers much that is of broader 
significance. The bock begins, for example, with an attempt to distinguish in 
general terms between permanent and quasi-permanent, between recurrent 
and non-recurrent, and between visible and invisible migration. These are 
terms that might well become standard. The propensity to emigrate, to take 
another example, varied widely from one region of the United Kingdom to 
another, but it was not associated especially with regional differences in 
either occupational structure or population growth (p. 76). These negative 
findings challenge two widely accepted hypotheses on why people migrate, 
and Isaac discusses briefly the possibility of supplementing such data with 
public opinion polls (pp. 248 ff.). 

According to Isaac’s estimate, while some 1,084,000 persons emigrated 
from the United Kingdom during 1946-1950, net emigration amounted to 
only 140,000, or substantially less than an insignificant one per thousand 
population per year. Even so, the movement has been important; Britain 
has lost skilled laborers and technicians, clerical workers, businessmen and 
professionals; and has gained an almost equal number of unskilled and semi- 
skilled workers. In demographic terms, there has also been a loss, especially 
by the emigration of about 100,000 war brides and their 50,000 children. On 
the other hand, contrary to what one might expect, old-age dependents 
were about three times more frequent among emigrants than among British 
population (p. 212). 

These migration trends have been the consequence, at least in part, of 
deliberate policy. The British government has encouraged and financed 
both emigration to the Dominions and immigration of aliens. One might 
ask whether this has been a reasonable process, or whether “Britain in the 
name of Commonwealth solidarity and strength has been making an un- 
necessarily heavy sacrifice of badly needed skilled workers” (Economist, 
August 28, 1954). Isaac is inclined to accept at face value the claim that 
subsidized emigration strengthens Commonwealth bonds, although it is at 
least possible that Britain’s “half-hearted and inconsistent” emigration 
policy (p. 242) has had, on balance, the contrary effect. For Britain’s inter- 
est has been essentially opposed to that of the Dominions: “without the 
operation of the agreements the number of emigrants leaving essential 
occupations would have been substantially larger” (p. 243). Isaac treads 
softly through this thorny area, and his discussion does not quite face up to 
the paradox of a government paying well for the loss of needed skills. More 
fundamentally, it seems to me that he has not yet resolved a dilemma evident 
in his earlier Economics of Migration. On the one hand, he patently favors 
free migration, both for its humanitarian value and for its relation to eco- 
nomic liberalism; on the other hand, he welcomes the state control of migra- 
tion even though this is typically set by nationalist, not to say mercantilist, 
motives. 
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Apart from such highly debatable questions, this is an excellent monograph, 
While narrower in ostensible subject matter than the Economics of Migra- 
tion, it should not be restricted to those especially interested in postwar 
British migration. Demographers generally have come to expect a high level 
and a wide range of genuine competence from Isaac, and they will not be 
disappointed in this work. 


Year Book of Labour Statistics, 1954. International Labour Office, Geneva, 1954. 
Pp. xv, 397. $5.00. 


Jack Currnicx, Rutgers University 


N THIS volume the I.L.O. continues its useful compilation of statistics on 

labor force composition and the economic position of the working popu- 
lation in countries all over the world. The data, derived from communica- 
tions to the Office or from official publications of the countries covered, 
are presented in eleven chapters and an appendix. The scope of the volume 
is indicated by the chapter headings: Total and economically active popu- 
lation; Employment; Unemployment; Hours of work; Wages and labour 
income; Consumer price indices and retail prices; Family living studies; 
Social security; Industrial injuries; Industrial disputes; and Migration. 
Each chapter i. introduced by brief notes (in English, French and Spanish) 
explaining the tables included. But the textual material includes no dis- 
cussion of the original statistical compilations in the countries covered. For 
such discussion of definitions, methods of collection and classifications of 
data, the reader is referred to other publications of the I.L.0. and the United 
Nations. 

Presumably, the collection in one volume of statistics on a world-wide 
basis is designed to serve two purposes: first, to make available in a central 
source the statistics on employment and labor conditions in a large number 
of countries; second, to permit international comparisons. The first purpose 
is quite adequately achieved in the Year Book, even though, in some cases, 
interest in a specific country or area of the world would, of necessity, have to 
be pursued beyond the data reproduced here. Useful aids to such research 
are included in this volume in the form of an index of countries included in 
each table (pp. 394-397) and a list of publications constituting principal 
sources of current statistics for the several countries. 

Unfortunately, the second purpose often is not realized. This, of course, is 
in no sense the fault of the compilers, but is due to the inherent lack of com- 
parability in many of the statistics prepared by national governments; there 
are wide differences in definition, techniques of collection, coverage, etc. 
However, the authors have managed, in some cases, to arrange the data so 
as to permit reasonable comparisons in certain naztow categories even where 
aggregate comparisons are not possible. They have, moreover, in the pref- 
atory notes to each table, done a useful service in cautioning against drawing 
international comparisons whenever these are rendered invalid by the origi- 
nal character of the data. 
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It should be added that East-West comparisons are not possible for a more 
cogent reason than lack of uniformity in the statistical base; of the com- 
munist countries, figures are shown only for Czechoslovakia and Poland and 
these appear in only a scattered few of the tables. 


Employment and Wages in the United States. W. A. Woytinsky and Associates. 
New York: Twentieth Century Fund. 1953. Pp. xxxii, 777. $7.50. 


N. Arvoup Touues, Cornell University 


HIS is the most monumental work on labor problems to appear since 

Millis and Montgomery’s three-volume Economics of Labor (1938, 1943). 
In spite of its more specialized title, Employment and Wages in the United 
States covers almost as broad an area as the earlier work. Moreover, the 
magazine format permits Woytinsky to provide almost as much printed 
matter in a single volume as did Millis and Montgomery in their three 
volumes. 

Millis steadfastly insisted that Economics of Labor was “not a textbook.” 
The present volume has the characteristics of an encyclopedia to an even 
greater degree. Yet instructors have already found, in this case as in the 
earlier one, that selections from such sources often provide more effective 
teaching materials than do the standard textbooks. There are disadvantages, 
to be sure. Students will be reluctant to purchase or to carry a volume of this 
size. More important, the very definitiveness of the materials is apt to be 
stultifying, unless the instructor points up the issues, propounds challenging 
questions, and bridges the gap between the students’ minds and the book. 
Woytinsky’s work is written in a much more lucid style, but it is less pro- 
vocative, than was Millis and Montgomery’s. 

As a reference, the present volume is superb, once the teacher or research 
worker has defined his own problem. The great advance in American labor 
statistics during the past 20 years is nicely demonstrated, especially in the 
areas of labor force, workers’ earnings in relation to national income and 
wage structures. Millis and Montgomery (and their colleague, Paul H. 
Douglas) were adept in squeezing the utmost meaning out of the quanti- 
tative materials of their time. Woytinsky and his associates have mined with 
great skill the much richer ores of the present day, especially those of the 
Social Security Board, the Bureau of Labor Statistics and the Census Bu- 
reau. Both within the articles and in numerous appendixes, these authors 
have provided large masses of refined statistics which will serve as a starting 
point of original contributions for many years to come. 

The selection of wages and employment as the fulcrum for the analysis 
of industrial and labor relations will seem old-fashioned to many persons, 
but this approach is heartily welcomed by the present reviewer. Indeed, 
his own complaint is that this focus is not kept in view with sufficient single- 
mindedness. In deference to current tastes, almost one-third of the work is 
concerned with “The Institutional Setting” (Part II). Very valuable in- 
formation is presented therein, which helps to make the book an encyclo- 
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pedia, but the relevance of this material to wages and employment is never 
made clear. 

Woytinsky has evidently sought to glean the most from a combination 
of theory, institutional history, and statistics. Personal evaluations of the 
result will be based on the judgment of each scholar as to which of these 
roads to understanding he considers to be most crucial. Admiration has been 
expressed for the handling of the statistics. The institutional history, how- 
ever important by itself, appears to this reviewer to be in the nature of 
“filler” in a work of this kind. 

As for theory, one notices at once the initial effort to relate the whole 
work to a theoretical framework. The very first chapter usefully recapitulates 
the nineteenth century wage theories, in a way that has a greater breadth but 
less penetration than the notable Volume I, Chapter IV of Millis and Mont- 
gomery. As for Woytinsky’s first chapter, one is most puzzled by the con- 
cluding assertion that the “great names—from Adam Smith and Nassau 
Senior to Marshall and Pigou—can be called as witnesses in support of... . 
four principles,” including the principle that “labor unions... are the 
mainspring that moves the mechanism of supply and demand in wage 
determination.” The classical economists were certainly reformers but they 
were not to that extent convinced of the central role of labor unions. Indeed, 
much of the factual analysis in this book seems to put labor union efforts in 
their appropriate place, rather than to make them appear as the “main- 
spring” of the economy. 

On its theoretical side, one misses any statement of a systematic theory of 
employment. While Part I (Wages) is introduced by an essay on theory, 
Part III (Employment and Unemployment) has no such introduction. It 
would seem that neither the influence of Keynes and his followers nor that of 
his critics and opponents have penetrated very deeply into the thinking of 
American labor economists. 

The intended climax to this volume is the text of the Report of the 
Committee on Wages and Employment of the sponsoring Twentieth Cen- 
tury Fund. This harmless but ineffectual little sermon exhorts the parties to 
collective bargaining to be tolerant of each other and especially exhorts labor 
unions to moderate their wage demands so as to avoid the danger of inflation. 
If anyone wishes to know the kind of compromise statement which could be 
agreed to in the year 1953 by a cross-section of union, management, and 
public representatives, the Report may be an interesting historical document. 
At least, the inclusion of the Report is less annoying than that of a sponsor’s 
TV “commercial.” However, there is little evidence that the thinking of the 
Committee was influenced by the scholarly efforts of Woytinsky and his 
associates. 

Some mention should be made of these “associates.” There were sixteen of 
them. Their names are buried in one of the four title pages, and in the appro- 
priate chapter-title footnotes, but they are not displayed in the otherwise 
excellent table of contents. While Woytinsky clearly organized and inte- 
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grated the whole work anc indeed wrote about half of all the forty-four 
essays (averaging 12 large pages per essay), the “associates” would seem to 
deserve greater prominence. Like Woytinsky himself, all of the associates 
are, or recently have been, Washington civil servants and they deserve a 
recognition which our society seldom gives to such important contributors 
to our basic knowledge. The least this reviewer can do is to list their sur- 
names, a8 follows: Arnow, Ball, Bowden, de Schweinitz, Douty, Ducoff, 
Evans, Grunfel, Huber, Kovner, Livernash, Ober, Pancoast, Sanders, 


Siegel, and Stocking. 


The Influence of Plant Size on Industrial Relations. Sherrill Cleland. Princeton: 
Princeton University, 1955. Pp. 65. $2.00. Paper. 


OswaLp Hatt, McGill University 


7 is a study of the relations between plant size and industrial relations 
in eighty-two plants in Trenton, New Jersey. Plant size is measured in 
terms of numbers of workers, and industrial relations mainly in terms of the 
degree of conflict between workers and management. The study bristles with 
the difficulties involved in trying to secure satisfactory numerical values for 
such things as income of workers, number of conflict situations, and so on. 
The actual correlations between plant size and pattern of industrial relations 
are not of a very high order. 

The main variation between plants seems to lie between those which have 
retained fairly intimate, personal relations between managers and workers 
and those in which professional public relations personnel and specialized 
union functionaries stand between the two. These newly emerging semi-pro- 
fessionals, brought in to offset some of the effects of increas ag size of plant, 
frequently bring a host of new problems in their wake. Hence it is not 
only plant size which influences industrial relations, but also the sorts of 
administrative machinery which have been set up to offset the disadvantages 


of size. 


An International Comparison of National Products and the Purchasing Power of 
Currencies. A Study of the United States, the United Kingdom, France, Ger- 
many and Italy. Milton Gilbert and Irving B. Kravis. Paris: Organization for 
European Economic Co-operation, 1954. Pp. 203. $3.00. 


Joun H. Apter, International Bank for Reconstruction and Development 


bie is an ambitious statistical study which attempts to do just what its 
title indicates: to compare the national products of the United States, 
the United Kingdom, France, Germany, and Italy, and the purchasing 
power of their currencies. The attempt has been eminently successful, the 
conceptual and statistical limitations of the undertaking notwithstanding. 

The objective of the study is the revaluation on # uniform price basis of 
the goods and services produced in 1950 (and, to some extent, in 1952), 
in each of the five countries. This required the gathering and sorting of in- 
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formation on the quantities of the various goods and services produced, and 
presented a pricing problem compiicated by quality differences from country 
to country of the same kinds of goods. 

From this exercise there can be derived five sets of figures showing the 
value of national products of the five countries at U.K., U.S., French, Ger- 
man, and Italian prices, and the purchasing power equivalents of the five 
national currencies. Actually, what the authors have done is to value the 
European national products at U.S. prices (and, to value the American 
GNP at the respective European prices), to compare the GNP’s of the four 
European countries and the U.S. at an average European price level, and 
to compute purchasing power equivalents on the same bases. To obtain 
meaningful and comparable quantity and price data required the rearrang- 
ing of the national accounts on a uniform—or nearly uniform—basis and a 
breakdown of the components of GNP into numerous subdivisions. (The 
main tables show a breakdown under 39 headings, but the text and subsidiary 
tables indicate, in many instances, a further breakdown.) 

The results are striking, but by and large, in accordance with expecta- 
tions, based on casual and piecemeal observations. If the national products 
are revalued on a uniform basis, whether at U.S. or European prices, the 
differences in per capita GNP between the U.S. and the European countries 
are very much less than if national product estimates are converted at pre- 
vailing exchange rates. To give but one example: average per capita con- 
sumption of the four European countries comes to U.S. $369, if valued at 
prevailing exchange rates, and to $615, if revalued at U.S. prices, compared 
with U.S. per capita consumption of $1,259. There are, of course, wide varia- 
tions among the European countries: per capita GNP, at U.S. prices, is 
$829 for the U.K., $663 for France, $523 for Germany, and $388 for Italy. 
In every instance, the conversion of national values at prevailing exchange 
rates results in a much lower figure. 

The detailed breakdown of the various ‘components of GNP, the computa- 
tion of purchasing power equivalents, and a series of other tabulations of 
results throws new light on a large number of old questions. For instance, it 
is most illuminating (and amusing) to see on what the average American 
consumer spends the $430 at his disposal over and above the $829 available 
to his British cousin. The American consumer eats a bit more (48 dollars 
worth) and, on the whole, better food: less cereals, potatoes, and fats and 
oils, but much more fruits, vegetables, and dairy products. He has a few 
extra drinks ($3), and he smokes a great deal more ($5). But the big differ- 
ences come in five expenditure groups: purchase of household goods (furni- 
ture, equipment and appliances, household and personal supplies) on which 
the American consumer spends $82 more than the British consumer or nearly 
three times as much; purchase of transportation equipment (cars, motor- 
cycles and bicycles) which account for $61 ef the American consumer’s 
expenditure, against $3 for that of the British; operation of transportation 
equipment—$53 in the U.S., as against $5 in the U.K.; purchase of clothing 
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and household textiles—$139 in the U.S., only $73 in the U.K.; and “miscel- 
laneous” expenditures which consist partly of tourist expenditures abroad 
($81 compared to $19). There are several other differences, but it is quite 
clear that about 60 per cent of the difference between the U.S. and the U.K. 
level of living is accounted for by the prevalence of consumer durables and 
semi-durables in the American consumer expenditure pattern. The differ- 
ences become, of course, much more pronounced in comparison with the 
continental countries. 

A counterpart of these differences in the expenditure pattern is to be found 
in the pattern of purchasing power equivalents. The purchasing power of 
European currencies is highest for commodities which the European con- 
sumer buys in larger, or in not much smaller, quantities than the American 
consumer; it is lowest for those items of which the former buys little and the 
latter much: cars, gasoline, household goods, clothing, etc. 

These findings are by and large in accordance with a priori expectation. 
There is, however, one price differential which is more difficult to account 
for and which may be of great significance for the long-term growth pros- 
pects of the European countries compared with those of the U.S. “On the 
United States pattern of expenditures, purchasing power in European coun- 
tries is about the same for both consumption and investiment, while on the 
European pattern of expenditures, investiment, except in Germany, is more 
expansive than consumption. In other words, to the extent that relative prices 
affect the allocation of resources, the European price structure is more favor- 
able to consumption and less favorable to investment than is the case in the United 
States” (p. 33—italics mine). 

As the quotation and what has been said above about the “content” of the 
difference in the level of living between the U.S. and the U.K. show, the 
study contains a wealth of essentially new information which deserves to 
be analyzed to bring out its broader implications. Apart from a few isolated 
remarks, the authors have confined themselves—probably rightly so—to the 
presentation of statistical results. It is to be hoped that they themselves and 
others will follow up the leads of the study. 

About half the book is taken up with a detailed account of the methods 
by which the quantity and price data were gathered—the authors received 
what must have been most valuable help from the Organization for European 
Economic Cooperation and the Economic Cooperation Administration- 
Mutual Security Administration staffs--2znd by which the numerous prob- 
lems of comparability, inadequacy of information, etc., were met. There are 
several paragraphs dealing lucidly with conceptual difficulties (e.g., the pric- 
ing of government services, the handling of quality differences), and, on 
occasion, the authors take issue with other writers who have discussed prob- 
lems of international comparability of national accounts, or, of national data 
generally. The account of the techniques used in preparing the data, and the 
comments on the broader issues of comparability, valuation, definition of 
end-products, etc., are refreshingly common-sense, and must impress anyone 
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who in his own work has faced similar problems, and has had to make similar 
value judgments. Welfare economists, both orthodox and sectarian, are 
bound to have misgivings about, and to take exception to, the free and easy 
way in which the authors have gone about tackling their problems and find- 
ing solutions where the welfare iconoclasts would deny the possibility of g 
solution. But those who prefer approximate common-sense answers to their 
common-sense questions will be grateful to the authors, and hope that the 
Organization for European Economic Cooperation will continue this work. 


Individuals’ Saving: Volume and Composition. Irwin Friend with the assistance 
of Vito Natrella. New York: John Wiley & Sons, Inc., 1954. Pp. x, 288. $5.00. 


Epwakp F. Denison, Department of Commerce 


_ is a most useful book which anyone concerned with interpretation of 
saving statistics will wish to read and to have available for frequent ref- 
erence. Lucid and concise writing enables the authors to present in limited 
space nearly everything of importance that is known about individual saving 
in this country since the twenties and, of at least equal significance, to in- 
dicate what is not known, and why. 

The range of subject matter is broad. It includes the meaning of personal 
saving and its conceptual relationship to business and government saving; 
description and appraisal of aggregate personal saving series derived by the 
three available methods—disposable income less consumption, investment 
less non-personal saving, and changes in individuals’ assets and liabilities— 
and of saving and related statistics which have been obtained by surveys of 
households; and an adjustment of the results of several of the household 
surveys to place them on a conceptual basis comparable to the aggregate 
data. The household survey results are found to depart widely from the 
aggregate series, which are in fairly good agreement among themselves. 
Trends in both aggregate individual saving and its components (in terms of 
asset and liability changes) are examined, with some consideration of long- 
term trends but greater stress on more recent, and particularly postwar, 
experience. There is a careful analysis of both household survey results and 
time series data to ascertain what relationships can be shown to exist be- 
tween saving rates and characteristics of the population and other economic 
variables; included is a critical analysis of deductions drawn by other in- 
vestigators. Consideration of a variety of historical statistical relationships 
between saving and disposable income and other variables, their a priori 
reasonableness, and their effectiveness in forecasting postwar saving, con- 
cludes the main part of the study. 

The last 123 pages of the volume consist of a greatly needed detailed de- 
scription, not previously available, of the SEC data on individuals’ saving, 
including a great deal of worksheet detail underlying the published estimates. 

In the present state of knowledge, interpretation of saving statistics in- 
evitably requires exercise of a great deal of judgment and understanding, 
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and even of intuition. The authors of this volume, from their long experience 
with the preparation and analysis of government data on saving, are unusual- 
ly qualified to express worthwhile opinions on what reasonably can and 
cannot be deduced from the available information. While exercising proper 
restraint and, for the most nart, providing the reader the necessary underly- 
ing evidence, they fortunately have not hesitated to express such views. 
With most of their judgments the reviewer would concur. 

One which seems questionable is that the aggregate saving estimate de- 
rived by measurement of changes in individuals’ assets and liabilities is 
probably more rejiable than the other two aggregate series. Most compo- 
nents of this estimate are obtained as a double residual. Individuals’ assets 
or liabilities at the beginning and end of a period are estimated by deducting 
estimates of the assets or liabilities of governments, corporations, and for- 
eigners from those for the economy as a whole; the change in individuals’ 
assets or liabilities during the period is then obtained as the difference be- 
tween the amounts outstanding at the beginning and end of the period. 
Other components, for which the change is estimated directly, include items 
for which data are particularly tenuous. It is by no means clear why this 
series should be preferred to the other two which, though also residuals, are 
derived from much smaller gross figures. 

The volume is unusually free of obscure, slippery, or contradictory pas- 
sages. One exception should be mentioned, however. On p. 99 it is said that on 
the basis of Kuznets’ data “it cannot be stated with any confidence that there 
was or was not a moderate upward or downward trend [since the eighties] 
in the ratio of individuals’ saving to income.” On p. 100 it is concluded from 
“aggregate and survey results” that “there does not appear to have been 
any substantial upward or downward trend in the ratio of individuals’ 
saving to income.” Unless stress is placed on the difference between “mod- 
erate” and “substantial”—neither term is quantified—this is a very differ- 
ent statement. The short intervening discussion of survey results, it may be 
noted, lends little support to the second conclusion, and particularly so in 
view of the authors’ low opinion of the quality of the survey data. 

That appraisal of consumer survey results, and an equally dismal view of 
the quality of nearly all pre-1929 saving statistics, may also call into ques- 
tion the authors’ use of survey data to ascertain saving patterns and of 
individual saving aggregates for the twenties in the derivation of relation- 
ships of saving to income and other variables. They are utilized cautiously, 
however, and presumably only in ways the authors consider warranted. 

The most general question the reviewer would raise about the discussion 
is whether the behavior of individual saving can satisfactorily be analyzed 
in isolation from that of business saving. As indicated in my article in the 
January 1955 Survey of Current Business, the record from 1929 to 1954, 
at least, suggests the presence of a more stable relationship between total 
gross private saving and total income or product than can be established 
separately between personal saving, corporate saving, and capital consump- 
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tion allowances, on the one hand, and relevant income series on the other, 
This at least raises a question as to whether changes in individual saving can 
be adequately analyzed without taking account of offsetting shifts in the 
other components of total saving. It may be noted also that it is the total 
saving relationship which is required for business cycle analysis, the parts 
being of less interest unless they help in appraising the behavior of total 
saving. It is no criticism of a study, however, to note that it is confined to its 
stated scope, and Friend and Natrella have rendered a valuable service in 
covering individual saving so adequately. 


A Century and a Half of Federal Expenditures. M. Slade Kendrick, assisted by 
Mark Wehle. New York: National Bureau of Economic Research, Inc., 1955. 
Occasional Paper 48. Pp. x, 102. $1.25. Paper. 


Henry C, Murpuy, International Monetary Fund 


H18 little booklet is an excellent reference work on Federal expenditures 

from 1794 to 1952. Kendrick and Wehle deserve the thanks of all persons 
interested in these statistics for their laborious and painstaking adjustment 
and classification of the underlying data, which are taken from a great 
variety of sources, and for the convenient form in which they have presented 
their results. 

The booklet contains annual figures of total federal expenditures for the 
whole period expressed (1) in current dollars; (2) in dollars of constant (1926) 
purchasing power; (3) in 1926 dollars per capita; and (4) as a percentage 
of gross national product (before 1870 of national income at factor cost). In 
addition, the data are broken down into “military,” “veterans,” “interest,” 
and “civil” (i.e., residual) expenditures for the whole period. Beginning in 
1915, the additional category of “foreign” expenditures is added. The authors 
have done their best to make these classifications reasonable. In general, 
they have done a very good job. The only important exception which occurs 
to the reviewer is that, in his opinion, it would have been more realistic to 
have included the expenditures on the development of atomic energy under 
the head of “military” rather than, as they have in iact done, under the head 
of “civil.” 

The story of the great increase in federal expendicures over the period as 
a whole and of the tendency for these expenditures—both military and civil— 
to rise in steps following each great war (as compared with the prewar 
period) is, of course, a familiar one. It is natural, therefore, that the re- 
viewer's attention should turn to special problems in the statistics. The most 
notable of these applies to the data on total Federal expenditures as a per- 
centage of national income during the first half of the nineteenth century. 
These percentages are based on the underlying estimates in Martin’s 
National Income in the United States, 1799-1938 (National Industrial Con- 
ference Board, 1939) although, according to Simon Kuznets, cited by the 
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authors of this study, Martin’s estimates have a downward growth bias 
and imply a decline in per capita real income between 1800 and 1840. It 
seems to the reviewer that such estimates provide shaky denominators for 
the ratios used in this study and that this matter should be thoroughly 
reinvestigated. 

In discussing the growth in military expenditures, the authors present 
a wealth of data showing that both the proportion of the eligible male popu- 
lation in the armed forces and the expenditures per capita of the armed forces 
have increased substantially over the whole period—although the latter have 
increased much more than the former. In connection with the former, it 
seems to the reviewer that the age bracket 20-39 years, which is used as that 
most representative of the universe of all males eligible for the armed forces, 
is too high. It would scarcely stand the test of the Marine Corps obstacle 
course at Parris Island. In connection with the latter, the authors have in- 
cluded an excellent appendix on “Technological Advances in the Weapons 
and Equipment of the Armed Forces.” This covers within a brief compass the 
major developments in this field over the entire period and the reviewer 


found it fascinating. 


Big Enterprise ia a Competitive System. A. D. H. Kaplan, with a preface by 
Robert D. Calkins. Washington: The Brookings Institution, 1954. Pp. xii, 269. 


$4.00. Paper. 
Grorce H. Hizpesranp, University of California, Los Angeles 


|" paar generation seeks to overthrow the beliefs of its predecessor. In doing 
so it erects 2 new orthodoxy. Intellectual history thus exhibits an endless 
dialectic of interpretation and reinterpretation of human experience. 

Currently, we are in the midst of a revisionist: movement concerning the 
role of big business in the American system. Equally sincere observers in 
the ’thirties saw bigness as a major cause of inefficiency, rigidity, and secular 
stagnation. Today the votaries of revisionism in this field view the large 
firm as the mainspring of economic progress, fostering new dimensions to 
competition, accelerating the rate of innovation, and contributing to an un- 
rivalled advance in the standard of living. That this profession of faith is 
but a repetition of the forgotten gospel of the New Era stays these prophets 
not, though the recollection may be embarrassing to some of them. 

I do not think I do him an injustice when I include Kaplan in the revi- 
sionist school. As he says himself at the end of his book, “Big business has 
not merely been kept effectively subject to a competitive system; on the 
whole it has made an essential contribution to its scope, vitality, and ef- 
fectiveness” (p. 248). His argument is reasonable, tempered by concessions 
to skeptics, and replete with many facts. How, then, did he arrive at this 
conclusion? 

Kaplan poses his problem in two parts: (1) What is the relationship of 
big business to the changing structure of the economy; and, (2) How are we 
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to interpret its performance? He attacks the first question largely as a statis. 
tical problem, relying heavily upon already well-worked materials, aug. 
mented by a new survival test of the 100 largest industrials, 1909-1948, 
Concerning performance, Kaplan depends upon analytical assertions, illus. 
trated here and there by fragmentary evidence. What emerges is an implicit 
and relatively untested, not to say optimistic, theory of big business, more 
likely to impress the already convinced than to convert the unconvinced. 

Regarding big business and the economic structure, Kaplan looks first 
at the place of large firms in the tota] business population, using size distri- 
butions based upon number of employees. Here he finds that “the giant firm 
has grown with the economy but has had no measurable effect on the dis- 
tribution of total economic activity as measured by employment” (p. 72). 
Next he turns to concentration of industrial production. Within the twelve 
largest manufacturing industries, plant »:ze shows remarkable variance, 
with technology the principal influence. This holds true also for company 
concentrations by industry. Neither measure yields a definitive index of 
market power, Kaplan believes, because “industry” categories oversimplify, 
freeze, and distort actual competitive relations. Nor do measures of con- 
centration by product turn out much better, partly for the same reason and 
partly because structural cross-sections conceal competition by innovation. 

So Kaplan contends that concentration data must be supplemented by 
case studies—surely a proper inference. Less persuasive, perhaps, is his own 
foray into this field, particularly his résumé of the aluminum story, where he 
concludes that “ .. . in the final analysis the price has been determined com- 
petitively in the market place” (p. 99). The only basis for this assertion is 
cross-product competition with other metals. If so, one may ask, as George 
Stocking does, whether market behavior was in any way modified by govern- 
ment-sponsored entry of two additional primary producers and by court 
decision in 1945? 

Kaplan next considers the relative financial position of the large corpora- 
tion—not because such measures are to him a reliable index of market power, 
but because financial scale commonly is so viewed and indeed often con- 
sidered cumulative. Yet, whatever the measure, he finds no agglomerative 
trend favoring the large firm—by share of total corporate assets, corporate 
profits, or national income. Kaplan makes no attempt to examine the ques- 
tion of corporate interlocks beyond a later assertion that banker control 
has given way to professional management. 

He breaks new ground with his test of turnover among the 100 largest 
industrials, 1909 to 1948. Of the original 100, 36 remained on the list at the 
end of the period. There were 47 newcomers in 1919, 28 in 1929, 11 in 1939, 
and 20 in 1948. So Kaplan concludes that turnover is high among the 
mighty—the top is a greasy pole, to borrow from Disraeli. Or in Kaplan’s 
words, “There is no reason to believe that those now at the top can remain 
there any more than did their predecessors, short of alert participation in 
continuous product and market development” (p. 142). Perhaps so, but be- 
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fore we accept this conclusion it would be of interest also to know what role 
mergers played in the creation of these new leaders. 

Regarding the place of large firms in the economic structure, Kaplan thus 
finds no causes for concern, either at the present time or on the basis of past 
history. He devotes the remainder of his book to an assessment of the per- 
formance of big business. 

Here we find an ably written interpretation of the place of size in a com- 
petitive system, along the now familiar revisionist lines. Prices, for good 
reason, may be somewhat less flexible in the short run, but quality variation 
and innovation generally prevent realization of the gloomy theoretical vision 
of static oligopoly with shared markets. Where the struggle to attain and to 
hold positions is seemingly endless, “live and let live” is presumably a poor 
philosophy. In the large concern compe<ition acquires a multiform aspect, 
reflecting internal pressures and rivalries, fickle and insatiable consumers, 
and a never-ending series of innovations with their enforced adaptations. 
Integration in all of its forms may lead to increased efficiency rather than to 
monopoly power, as at the turn of the century. 

Kaplan concedes that difficult problems for public policy will still emerge, 
in certain types of discrimination, in “undue” leverage upon buyers and 
sellers. However, there is little that is new in his account. While calling for 
a “flexible policy” guided by a conception of reasonable limits in particular 
cases, he provides no concrete tests of what is unreasonable as to size, mer- 
gers, or market practices. Integration as such, presents to him no serious 
general problems, and “ . . . has undoubtedly proved its capacity to promote 
expansion of the entire economy, including additional opportunities for 
smaller businesses and new users” (p. 230). By contrast, and here surely he is 
correct, legislative attempts to enforce uniform prices have injured the com- 
mon good. 

The reassessment of the revisionists was no doubt overdue, but there is 
some danger that the reaction may go too far, though I do not intend this as 
a criticism of Kaplan in particular. Plainly the classical model of competition 
has serious shortcomings as a guide for interpreting behavior in a rapidly 
changing economy or for setting policy in that economy. Nonetheless, a 
satisfactory new theory has yet to be developed, if indeed one is possible. 

As the foremost member of the new school, Schumpeter was probably right 
in suggesting that the rise of the large firm and the impressive advance in 
output were more than a chance coincidence. Yet before we proceed to 
relax comfortably on yet another “inevitable historical trend” we might well 
recall that large enterprise can also lead to stagnation, as much European 
experience indicates. In general the American story has been otherwise, 
given the differences in our cultura! tradition and in the environment in 
which that tradition has happily found expression. Yet we have some record 
of anti-competitive behavior here in the United States. Vigilance, rather 


than complacency, is the price of safety. 
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Capital and Output Trends in Manufacturing Industries, 1880~1948. Occasional 
Paper 41. Daniel Creamer. New York: National Bureau of Economic Research 
Inc., 1954. Pp. 104, $1.50. 


Kenneta A, Bour, International Bank for Reconstruction & Development 


— paper is one of a series describing research being conducted by the 
National Bureau on long-term trends in capital formation in the major 
capital-using sectors of the U.S. economy. It presents concisely yet with 
ample documentation the major findings for manufacturing industries. In 
a long introduction Simon Kuznets places the paper in the perspective of 
the larger program of research and speculates briefly on some possible causes 
for the observed trends. 

The analysis is centered around the behavior of the ratio of capital stock 
to output (the capital-output ratio) for a series of benchmark years (1880, 
1890, 1909, 1919, 1929, 1937 and 1948) seiected on the basis of availability 
of data and similar relative positions in the economic fluctuations of the 
period. Data on capital stock were obtained fium Censuses of Manufactures 
for the period 1880-1919 and from corporate income tax returns for 1929 
and thereafter. Measures of output were obtained from the same sources. 
The principal finding is that a significant rise in the capital-output ratio for 
manufacturing industry as a whole took place from 1880 to the decade 1909- 
1919 followed by a definite and substantial decline to 1948, the most recent 
date studied. In other words although capital stock increased throughout 
the period its rate of growth exceeded that of output up to around 1919 and 
was exceeded by output’s rate since then, at least until 1948. This observed 
trend was tested in various ways. It was found that it was not affected by 
varying the concepts of capital and output used. It held when either total 
capital, fixed capital or working capital were used and it held when output 
was taken gross or net. It was also found that neither the changing structure 
of industry nor the changing size of firms offered adequate explanations of the 
trend. 

In addition to the main findings on manufacturing industry as a whose 
mention should be made of the interesting material presented on the behavior 
of various groups of industries—material collected in order to investigate 
the change in industrial structure over the period and the effect of this 
change on the over-all ratio. It was found possible to distinguish 41 minor 
industries classified into 15 major groups over the entire period. The dis- 
tribution of total capital stock was computed in terms of these 41 industries 
for each benchmark year and capital-output ratios were determined for each 
industry for each year. The detailed picture this data presents of the chang- 
ing structure of industrial capital, changing capital-output ratios over time, 
and of the marked differences in the levels of the ratios among industries 
represents a considerable amount of new knowledge that should certainly 
prove most useful to students of industrial growth. 

No attempt is made to explain the findings. However, various suggestions 
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of possible explanations are made by Creamer, and by Kuznets in his in- 
troduction. Both mention that the spread of technological innovation of a 
“cgpital-using” type may help explain the early rise in the capital-output 
ratio and Creamer also suggests that extensive industrial development as 
shown in the rapid growth in the number of establishments during this period 
may have also been a factor. To account for the later decline in the ratio 
technological change is also mentioned as well as the usually assumed drive 
of the entrepreneur to use scarce resources more efficiently. And on a more 
specific level Kuznets suggests that with a decline in the rate of growth of 
gross fixed capital formation the per cent of current capital stock renewed 
tends to grow and the renewed stock, generally being more productive than 
that which it replaces though costing no more in constant price terms, tends 
to depress the ratio. 

It is of course a long way from a suggested hypothesis to a tested explana- 
tion. However, a forthcoming monograph which promises to deal with the 
same subject matter in more detail may be expected to make the task of 
explanation somewhat easier. So this excellent study leaves one stimulated, 
informed and eagerly awaiting the promised sequel. One might add that in 
these times, when economists are increasingly preoccupied with problems of 
economic growth and when the concept of the capital-output ratio is paying 
for its current popularity with a certain amount of indiscriminate use, it is 
particularly satisfying to see a competent study such as this supply a much 
needed perspective to the concept, at least insofar as it refers to one impor- 
tant economic sector. 


Canada and the International Business Cycle, 1927-39. Edward Marcus. New 
York: Bookman Associates, 1954. Pp. 211. $3.75. 


R. Craia MclIvorn, McMaster University 


tT case study was designed te narrow a serious theoretical and empirical 
gap in the literature of the business cycle, i.e., the problem of how cyclical 
influences are transmitted internationally. Specifically, it is concerned with 
the interrelationships among Canada’s domestic activity, foreign trade, and 
international capital movements, as they have reacted and adjusted to forces 
originating abroad—whether in response to the business cycle of an impor- 
tant customer (e.g., the United States or the United Kingdom), or the 
workings of international competitive forces. 

The first two chapters provide a summary economic background and an 
analytical framework from which the detailed study of cyclical forces is 
subsequently developed. Thus, such features of the 1927-29 Canadian econ- 
omy as its primarily extractive basis, its relatively great dependence on 
foreign trade and the unique geographic distribution of that trade, vis-A-vis 
the United States and the United Kingdom, are noted. Also mentioned are 
the lack of a central bank and of discretionary monetary control before 
1935, the de facto departure from gold at the end of 1928 and subsequent 
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exchange fluctuations, and finally, the shift in Canada’s international posi- 
tion from a net capital importer to exporter during this period. 

It is asserted that the Canadian business cycle is subject to two groups of 
external influences, i.e., forces of a general economic nature and those of 
cyclical nature. The former, which include such variables as changing tastes, 
customs, trade barriers and competitive pressures, are not produced by 
any cyclical movement abroad, while the latter are directly related to, and 
would not occur in the absence of, such external cyclical movements. It is 
the author’s basic contention that the external determinants of Canada’s 
downturns in 1929 and 1937 are to be found in the general economic forces 
just noted, but that once her economy became depressed, Canada was 
powerless to initiate a recovery, this depending upon a prior upturn in the 
United States and the United Kingdom, i.e., upon external cyclical influences, 
Summarizing this thesis, Marcus claims that “somewhat facetiously, one 
might say that Canada enjoyed enough independence to get herself into 
trouble, but that she needed outside help to emerge from the resulting de- 
pression.” 

The detailed analysis of the Canadian fluctuations is divided into four 
periods: 1927-29 (boom and collapse), 1930-33 (recession and recovery), 
1933-37 (recovery) and 1937-38 (recession and recovery). The end of the 
Canadian boom during the late ’twenties is found to have been foreshadowed 
by sagging exports of woodpulp, newsprint, wheat, and flour in 1927 and 
1928. The adverse effects upon income “rippled” through the economy with 
cumulative force, and the downturn occurred several months before the 
comparable development in the United States. These depressing influences 
are described as “exogeneous forces of a somewhat random nature” inde- 
pendent of any cyclical influences abroad. 

As long as Canada’s two main export markets, the United Kingdom (wheat 
and flour) and the United States (woodpulp and newsprint) remained de- 
pressed, recovery was impossible, but following their respective upturns in 
1932 and 1933, the Canadian economy emerged from its period of stagnation 
to one of steady expansion. This reversal is found to have been clearly de- 
pendent upon cyclical movements abroad. In the case of the 1937 recession, 
Canada’s wheat and paper export indices faltered several months before the 
actual business downturn, “in a pattern strikingly similar to that of 1927- 
29.” Once again the explanation of this faltering runs not in terms of cyclical 
movements in the United States and the United Kingdom, but rather of a 
“fortuitous” combination of random occurrences. When World War II ar- 
rived, the Canadian economy had once again entered a period of expansion, 
for which the main stimulus was provided by rising exports related to the 
European armament program, clearly a non-cyclical influence. 

The responses of the Canadian balance of payments to external disturb- 
ances during the whole period are alleged to have differed from what might 
reasonably have been “expected” (gold standard conditions being assumed). 
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Mercus finds that during the economic contraction following 1929, imports 
a¢4val!y fell more than exports, an unfavorable balance of payments became 
favcrable, and net export of capital developed in the early ’thirties. Con- 
yersely, the boom of 1937 was accompanied by a rapid deterioration in 
Canada’s current account, and it is generally regarded as surprising that 
“the worse the state of domestic activity, the better the international posi- 
tion became.” Two brief comments may be made on this generalization. The 
first is that it was simply not the case during the upturn after 1933; the 
second is that the author, instead of rejecting the foreign trade multiplier 
as an irrelevant concept, might have incorporated it into his analysis, recog- 
nizing how its operation might be modified by “particular” circumstances. 

One of the most interesting sections of the study deals with the well-known 
limitations of fiscal policy in an open econcmy. It is not surprising that Cana- 
dian government deficits are found to have exerted little influence upon the 
course of economic affairs during the ’thirties. For the most part, these were 
“reluctant” deficits, for the philosophy of counter-cyclical financing can 
scarcely be said to have gained serious acceptance in Canada before World 
War II. 

In any such empirical investigation, it is easy to raise questions both of 
fact and of analysis and interpretation. In the former category, we have 
already noted Marcus’ implication of the negative correlation between 
national income and the current account balance after 1933; since the wheat 
“holdback” plays an important part in the analysis of the 1929 decline, it is 
not irrelevant to point out that it appears to have occurred about a year later 
than suggested. Concerning analysis and interpretation, the out-of-hand 
rejection of the foreigu trade multiplier may be seriously questioned. The re- 
viewer would also have welcomed a more explicit assessment of the relative 
importance of domestic versus external factors in Canadian cyclical fluc- 
tuations. At various points, there are unqualified statements whose validity 
is surely open to serious question. Thus (p. 179), with reference to the early 
thirties, “the use of exchange depreciation was also circumscribed, thus 
preventing the easiest means of stimulating exports.” A plausible argument 
of many years’ standing suggests that depreciation, in the particular Cana- 
dian circumstances of the time, might have had precisely the opposite effect. 
Again (p. 188), with reference to the interest rate as an instrument of eco- 
nomic control, “Canadian economists feel that the influence of the rate of 
interest is not too great.” Granted that such a statement has little meaning if 
taken literally, one may still question its general validity, in the light of re- 
cent monetary experience. 

Such criticisms should not obscure the fact that a great deal of valuable 
empirical research is contained in this detailed study of the Canadian econ- 
omy, and no student of cyclical fluctuations can fail to benefit from a careful 
reading of this monograph. 
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The Economics of Recession and Revival: An Interpretation of 1937-33, 
Kenneth Roose. Yale University Studies in Economics, Vol. 2. New Haven: Yale 
University Press, 1954. Pp. 280. $4.00. 


Bert G. Hickman, Council of Economic Advisers 


HIS is an ambitious book. Roose has set for himself the difficult task of 
assessing the relative importance of various causal factors in the contrac- 
tion of 1937-38. Among other factors he discusses monetary and fiscal 
policies, price-cost relationships, changes in the degree of monopoly power, 
underconsumption, shortage of capital, and the political and social turmoil 
of the period. The opinions of other writers, methodological issues, and the 
implications of the study for the formation of public policy also receive 
attention. Roose is primarily concerned with the causes of the downturn 
of 1937, but the upturn of 1938 is treated briefly and at various points be 
addresses himself to the question of why the entire expansion from 1933 to 
1937 was so lacking in vigor. Unfortunately, his ambitions are not fully 
realized and his major conclusions do not carry complete conviction, partly 
because of the complexity of his subject matter and partly for other reasons. 
One source of difficulty may be mentioned in passing. The presentation of 
statistical data is such that it is sometimes quite difficult for the reader to 
follow the analysis and to check the judgments of the author—an important 
drawback in a study of the quantitative-historical type. Most troublesome 
in this connection is the frequent introduction of data into the text in the 
form of values on two or more comparison dates or of percentage changes 
between such dates. Where data are not shown as continuous time series, 
it is often impossible for the reader to answer questions that come to mind 
concerning variations in rates of change during the intervals between com- 
parison dates, time-lags, and the like, and the force of the author’s argument 
is inevitably reduced. 

The study is organized around a national income framework in the sense 
that attention is focussed on the major income and expenditure flows, and 
developments in the markets for money, wages, and goods are viewed from 
the perspective of their possible influence on investment and consumption. 
This does not mean that use is made of formal aggregative cycle models; 
on the contrary, Roose is often interested in hypotheses and variables which 
cannot be fitted easily into such models, and his chapter on methodology 
stresses the uniqueness and complexity of each historical business fluctuation. 
What it does mean is that wherever possible a given causal hypothesis is 
tésted against both the observed movements of the variables directly in- 
volved and the relevance of those variables to income and expenditure flows. 
In this connection, it is worth noting that each hypothesis is tested in turn. 
' An advantage of this procedure is that it facilitates the formal testing of 
alternative theories, but it also carries an important disadvantage. It tends 
to direct attention away from possible inter-relationships between develop- 
ments in various sectors of the economy. 

Let me illustrate. In the chapter on fiscal policy, the author argues that 
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the sharp decline of “net government contribution to income” in the first 
half of 1937 was not the primary cause of the downturn on the grounds that 

rsonal income did not decline until the third quarter and that income- 
expenditure lags could not account for such a long lag. Although he points 
out that the bulk of government expenditures of that period were quickly 
received as income, he fails to note at this point that Barger’s series on ad- 
justed income and consumer expenditures decline sharply in the first quarter 
and rise only moderately in the second quarter. Later, in the chapter on 
consumption, “underconsumption” is rejected as an explanation of the down- 
turn, primarily on the grounds that “Only the quarterly marginal propen- 
sities to consume show consumption lagging, and even here it accelerated 
immediately prior to the recession” and that “consumers’ income was ap- 
parently not outdistanced by the production of consumers’ goods...” 
However, Roose’s calculations show that real adjusted income, real consump- 
tion, and consumer goods production all declined between the last quarter 
of 1936 and the second quarter of 1937, and that consumer goods production 
continued to fall even when income and consumption accelerated briefly in 
the third quarter. Still later, in the chapter on inventories, the judgment is 
reached that although inventory investment increased quite substantially 
during the first three quarters of 1937, inventories did not become burden- 
some until income and expenditures declined, end were therefore a secondary 
factor in the recession. This judgment is reached despite the fact that he 
earlier concludes that inventories were excessive relative to new orders in 
the summer of 1937 and the further fact that new orders declined substanti- 
ally throughout the first half of the year. 

It is not implausible that the various developments just described were 
causally related and were of considerable importance in precipitating the 
downturn. That is, the decline in net government contribution to income 
may have induced the decline in consumer expenditures and the latter in 
turn may have been responsible for the decline in consumer goods production 
and in new orders during the first six months of the year. Under the circum- 
stances, it is difficult to accept without reservation Roose’s judgments that 
the slackening of consumer expenditures in the first half of the year had 
little to do with the recession or that rapidiy mounting inventories were not 
considered troublesome enough to lead to efforts to reduce them until late 
in the year. Perhaps the hypothesis I suggest would not stand up under 
careful empirical investigation, but the point that I wish to stress is that it 
is not investigated at all, quite possibly because each theory of the turning 
point is discussed independently. A procedure wherein alternative hypotheses 
are tested independently and those which are judged consistent with ob- 
served behavior are retained and the others rejected may not yield the same 
set of causal factors or the same judgments of relative importance as a proce- 
dure in which hypotheses are considered as possible complements as well as 
competitors. 

Roose concludes that three major elements were responsible for the down- 
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turn. The decline in net government contribution to income shifted the 
responsibility for sustaining and increasing aggregate activity to the private 
sector. The new undistributed profits tax reduced the liquidity of corpora- 
tions and the restrictive monetary policy of the Federal Reserve raised the 
cost of capital, while both actions adversely affected business expectations, 
“Most important of all, however, was the reduced profitability of invest- 
ment, beginning in the first quarter of 1937. This resulted from the increased 
costs, in which labor costs played a prominent part.” In my opinion, a Scotch 
verdict of “not proven” should be returned on this last proposition—the data 
with which to establish either guilt or innocence are simply not available. 
As Roose himself points out, the existence of a price-labor cost squeeze 
cannot be clearly demonstrated by comparisons of changes in average hourly 
earnings and wholesale prices. Data on labor costs per unit of output, which 
take account of changes in productivity as well as in wage rates, would seem 
to be a minimal requirement for this purpose. Moreover, average hourly 
earnings did not increase sharply relative to prices of finished goods until 
the second quarter of 1937, or the very quarter in which prices of raw ma- 
terials and semi-manufactured goods began to decline, so that higher wage 
rates may have been offset at least in part by falling prices of materials. 


American Agriculture: Its Structure and Place in the Economy, A Volume in the 
Census Monograph Series. Ronald L. Mighell. New York: John Wiley & Sons, 
1955. Pp. xii, 187. $5.00. 


VARDEN Fuuuer, University of California, Berkeley 


HE Director of the Bureau of the Census and the President of the Social 

Science Research Council in the foreword to this volume explain that it 
is part of an effort to regenerate the analytical census monographs which, 
because of depression and war, have declined since the Census of 1920. 
The publisher says that it is one of a series “of interest to economists, agricul- 
tural experts, statisticians, political scientists, and other social scientists.” 
In contrast, the author’s preface states that the book “is addressed to readers 
who are interested in agriculture, but who are not necessarily professional 
agricultural specialists. Its purpose is to present in ordinary words a picture 
of the structure of agriculture. ...” Thus, it would appear that the book 
was launched and produced under divergent conceptions of purpose. 

The major misfortune of the book seems to lie in the effort ce”.currently 
to serve these essentially different purposes. Moreover, this effort is made 
the more ineffective by the discussion of questions not likely ever to be asked 
by the nonspecialist and in reference to which the specialist would im- 
mediately seek more basic sources and exhaustive treatments. Statistical 
and conceptual problems in measuring scale of farm enterprises and their 
respective efficiencies, and in defining and measuring types of farming and 
explaining their locations are prominent examples. 

The work offers very little material based on special tabulations or in- 
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tensive analyses of census data. Indeed, the Census has no outstandingly 
prominent role; regularly published works, agricultural college and Depart- 
ment of Agriculture bulletins, and estimates made by the author’s colleague- 
specialists in the Department of Agriculture share the burdens of source 
and authority. Most of the chapters deal with orthodox subjects—scale of 
farms, types of farms, farm tenure and debt, social features—that have 
traditionally been found in texts on farm management, land economics, 
rural sociology, or agricultural economics. In respect to these subjects, there 
is some recent information, particularly on trends and tendencies of change, 
but less new analysis and virtually nothing new for the specialist who has 
kept informed. 

The chapter on “Agriculture in the Total Economic Process,” a significant 
subject of growing interest and on which the 1950 Census obtained relevant 
information, occupies 64 pages that innocently discuss the percentage of 
population on farms, the percentage of national income that originates in 
agriculture, the farmer’s share of the consumer’s dollar, the expenditures of 
farmers for production goods and concludes that a farm is a biologics! manu- 
facturing unit that is interdependent with other industries. Curiously, input- 
output relations are not treated under this heading but in a later chapter 
called Group Interests in Agriculture, and there only to the extent of a short 
abstract from Leontief’s Scientific American article. In this context, the 
author employs input-output not to illuminate intricate economic relations 
but to help convince one (actually, Adam Smith no less) that the twentieth 
century is very different from the eighteenth and that in consequence “he 
would begin to understand why a labor movement and a farmers’ movement 
have arisen to provide countervailing power for these groups.” 

Again, in a chapter on “Farm Tenure and Debt,” the author picks up, then 
drops in two sentences, the subject of contractual production. Although this 
is a direction of rapid and significant expansion with far-reaching implications 
for production, processing, and marketing, we are inconclusively advised in 
the second of these two sentences that: “Contracts take many forms; the 
processing company gues so far in some instances as to prepare the land, 
supply the seed, and harvest the crop.” 

Another aspect of “Agriculture in the Total Economic Process” that con- 
cerns both the specialist and the nonspecialist is incomes and income source 
relations. Aspects of this subject are picked up in several chapters but no- 
where definitely or unambiguously discussed. In Chapter 2, percentages of 
population on farms are set up against percentages of national income from 
agriculture but with no explanation that farm population and the popula- 
tion engaged in agriculture are not identities. In Chapter 3, we are told, 
strangely, that “hired labor is paid for as in other occupations.” In Chapter 4, 
the text says (p. 55) that earning money from agriculture is the primary in- 
terest of all commercial farm cperators whereas the footnote says it is not; 
more lamentable is the fact that the 1950 Census endeavored to collect in- 
formation that would throw some light on the complexity of occupational 
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relations between agriculture and nonagriculture, of which the author makes 
virtually no use. Chapter 9 again picks up the subject of incomes and their 
magnitudes by classes of farms, and although nonfarm income received by 
farm families is included here, its interoccupational significance is lost in the 
emphasis on income differences within agriculture. 

The book unfortunately is well supplied with evident ccntradictions and 
non sequiturs. In Chapter 1, some soliloquial-like sentences wondering 
whether what has happened in agriculture in the past hai’ -entury should be 
called “change,” “trend,” or “development” lead abruptiy to invoking the 
authority of T. W. Schultz to pronounce there is no received theory explain- 
ing economic development. Similarly, a series of sentences (pp. 24-25) which 
declare that modern day technology decreases the rigidity of our bond with 
the soil terminate with: “In the Western World the fear of Malthusianism 
recedes into the background.” Those who pay attention to what they have 
read from one paragraph to another will have to resolve for themselves on 
such matters as whether hybrid seed corn is expensive or inexpensive (pp. 
14, 24), whether man has attained control of his physical environment or 
not (p. 41), whether or not the “agricultural ladder” really exists and if so 
can it be climbed (p. 95), whether tenancy is good or bad (pp. 96, 101). 

A book with so pretentious a title and so well sponsored ought either to be 
a useful monographic source for the specialist or an enlightening and un- 
ambiguous descriptive survey for the general reader and the undergraduate 
student. This is neither. 


Factor Analysis. An Introduction and Manual for the Psychologist and Social 
Scientist. Raymond B. Cattell. New York: Harper and Brothers, 1952. Pp. xiii, 
462. $6.00. 


Hersert Soiomon, Teachers College, Columbia University 


N THE preface Cattell states that this book has been written to meet three 

princ.pal requirements. “First it sets out to meet the need of the general 
student in science to gain some idea of what factor analysis is about and to 
understand how it integrates with scientific methods and concepts generally. 
Second, it is intended as a textbook for statistics courses which deal with 
factor analysis for the first time, either as an appreciable part or as the whole 
of the semester course. The third objective is to supply a handbook for the 
research worker, the student, and the statistical clerk which will be a prac- 
tical guide with respect to carrying out the processes most frequently in 
use.” The book is divided into three sections, each designed to treat one of 
the three aims. On the whole, Cattell does very well by his goals and makes 
a valuable contribution to the literature of factor analysis. 

The book is written wholly in the framework of the multiple factor struc- 
ture; a model which states that a response is a linear function of factors. 
Where the book treats this model as a research technique applicable to many 
different disciplines, much more generai use can be made of it than where it 
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treats the model as actually representing a behavioral pattern leading to 
observed psychological responses. The statistician can read this book without 
too much concern for the psychological implications of the model. He can 
borrow many of its procedures for use in other disciplines, but the statistician 
should be aware that, as the reviewer wholeheartedly feels and the author 
states in his preface, “Factor analysis requires the skills of an art as well as 
statistical principles.” 

Thurstone’s centroid method of estimating parameters in the multiple 
factor structure is featured prominently in the text. Very little space is de- 
voted to Hotelling’s principal components method of estimating the param- 
eters, and this is to be expected if the “simple structure” concept is to be 
applied after factorization. However, some psychology groups are still 
interested in principal component estimates, and it may prove useful in 
other disciplines. There may even be a use for principal component analysis 
under the assumption of the “simple structure” criterion if the criterion is 
applied before factorization rather than in concert with the factorization. 

Within the framework the author has set for himself, he writes lucidly, 
interestingly, and intelligently. Cattell gives a nice development of “alpha- 
bet soup” factor analysis; that is, the labelling of a factorization by the man- 
ner in which the elements of the initial correlation matrix are computed. 
For example: in R-technique, traits are correlated; in Q-technique, persons 
are correlated; in P-technique, traits on one person are correlated; in O- 
technique, correlations refer to correlations of occasions for a single person. 

This book will not be as appropriate for the uninitiated student as the 
author thinks it is; nor will it be the manual for the statistical clerk that the 
author has strived to make it; but it is a valuable addition to the list of text- 
books for courses in factor analysis and to the list of reference books for 
serious researchers. 


The Population of Yugoslavia. Paul F. Myers and Arthur A. Campbell. Inter- 
national Population Statistics Reports, Series P. 90 no. 5. U. S. Bureau of the 
Census. U. S. Government Printing Office, 1954. Pp. vi, 161. $1.00. 


Mio’ Macura, Yugoslav Federal Statistical Office 


ye book appertains to the series of reports devoted to the population of 
foreign countries published by the Bureau of the Census. Apart from 28 
tables and 16 charts, the report contains 112 pages of text and 6 appendixes. 

To this reviewer the tables are of the greatest interest. Most of them (24) 
are well selected from the prewar and postwar Yugoslav publications bearing 
on population census figures, vital statistics, school statistics and labor statis- 
tics. International statistics are represented by two tables, whereas the 
tables “Abridged life tables for males and females 1950” and “Estimates and 
projections of the population of Yugoslavia 1953-1970” have been prepared 
by the authors and their collaborators. It should be stressed that the con; 
version of data to actual boundaries and states of Yugoslavia exacted & 
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comprehensive and meticulous work, successfully published now. In doing 
so, the authors encountered certain discrepancies in methods and procedures 
successfully overcome in general.? In working out the population projection 
1953-1970, the authors had to prepare the abridged life tables. It was not 
possible to ascertain whether they made use of the Yugoslav life tables* for 
the prewar period. The future number of the Yugoslav population computed 
on the basis of four assumptions (amounting from 20,983,000 to 22,987,000 
of inhabitants for the year 1970), is discussed in the last chapter of the text, 
“Prospects for population growth.” Appendix F gives pertinent technical 
detaiis. The reader gets the impression that this chapter is a most successful 
one as regards the methods and carefulness applied. 

The first eight chapters of the text deal with: “Summary,” “Introduction,” 
“Population development,” “Distribution of the population,” “Demographic 
and social characteristics of the population,” “Education,” “Economically 
active population,” and “Differential fertility.” Apart from the interpretation 
of statistical facts, these chapters give surveys of historical, social, economic, 
cultural, political, etc., factors and institutions. 

It is very difficult for an author without deep knowledge of conditions, in- 
stitutions and language of a country to enter into such considerations with 
the purpose of giving meritorious opinions; and the suthors of The Popula- 
tion of Yugoslavia faced just such a difficult assignment. Jt should be stressed 
and appreciated that they consulted some eighty sources, but less than 
twenty were Yugoslav ones. Nevertheless, the sources were insufficient for 
such a study, particularly owing to a very small number of Yugoslav original 
studies published in English.‘ The authors were not sufficiently aware of the 
influence exerted by the historical factor upon the shaping of the population 
of different regions of Yugoslavia. A very slow economic development and an 
agrarian overpopulation have been appreciated in rather cursory a manner 
and without necessary consideration of all the conditions of developmeni. 
The postwar efforts made in the country’s industrialization and economic 
progress are valued postively. However, this reviewer feels that the authors 
did not comprehend the whole complex of the economic problems of coun- 
tries not advanced industrially. 

The Yugoslav reader will ask himself how was it possible that the authors 
so far underestimated war losses of the population and overestimated the 
emigration; he will be particularly surprised at the overestimation of Ger- 





1 Two appendixes are devoted to it: A. Former Italian territory ceded to Yugoslavia in 1947, and 
B. Relationship between the boundaries of the historical provinces, banovinas, and the present republics 
in Yugoslavia. 

2 See appendix C. Population Censuses of Yugoslavia, D. Differences between census and other 
population data, and F. Errors in reporting age. 

* Lah-Zorga: Tabele za finanéno in aktuarsko matematiko (Tables for financial and actuarial 
mathematics). Ljubljana, Dréavna salo%ba Slovenije, 1947. 

4 In my opinion, some of smaller errors are not serious, as for instance, the first natality statistics 
were available from 1890 (p. 3), that Istra forms a part of Slovenia (p. 10), that Germans attacked 
Yugoslavia on April 1st,.1941 (p. 11), that natality and mortality rates are established in 1891 only 
(p. 17), in appraising internal migration (p. 38), in appraising a high sex ratio (p. 42), ete. 
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man minority emigration and at the underestimation of the mortality of 
the same minority taking part in the Nazi armed forces occupying Yugo- 
slavia. The Yugoslav reader wonders also at the doubts expressed by the 
authors in evaluating the accuracy of the answers regarding religious 
conviction in the 1953 Population Census; and this the more so because the 
needed information was, or could have been made, available to the authors.§ 
Remarks of similar kind may be made also with regard to the passages 
setting out in an inadequate manner the course of the recent history of 
Yugoslavia, etc. 

Nevertheless, these critical remarks cannot diminish the value of the 
publication. As the first publication on the Yugoslav population appearing in 
English, it deserves full attention.® It contains needed information on essen- 
tial problems of the development and condition of the Yugoslav population. 
As Yugoslavia is now in the process of rapid industrialization and general 
development, such a book is of interest not only as containing bare informa- 
tion, but as a basic text-book for forthcoming researches as well. In this sense, 
tribute is to be paid to the Editor’s initiative and to conscious ard assiduous 
efforts of the authors. 


Food Consumption of Urban Families in the United States, with an appraisal of 
methods of analysis. Agriculture Information Bulletin No. 132. Faith Clark, 
Janet Murray, Gertrude S. Weiss, Evelyn Grossman, U. S. Department of Agri- 
culture, Agricultural Research Service, Home Economics Research Branch. 


Doris P. Rorawetu, U.S. Bureau of Labor Statistics 


y poe 200 page bulletin is a useful addition to the library of reference 
materials on family consumption. It presents results and also describes 
methods for their analysis and is well organized for the user of the data. Over 
half the volume consists of standard statistical tables presenting basic data 
on urban family food consumption for groups and items, during the period 
1948-49. These tables alone would make this a valuable publication. It 
brings together in one volume results of the nation-wide survey of urban 
housekeeping families in the spring of 1948, surveys of housekeeping families 
in 4 cities (Birmingham, Ala.; Buffalo, N.Y.; Minneapolis-St. Paul, Minn.; 
and San Francisco, Calif.) in the winter of 1948, and seasonal surveys in 





® It is characteristic, that the quality control of last census data omitted the verifying of the 
answers to the question on religion. It was, namely, considered that such 4 question bears an eminently 
subjective character, so that it might not be submitted to any control. Zarkovié, S.S., Population Census 
errors, Beograd 1954; P. Depoid, Rapport sur le degré de précision des statistiques démographiques, 
International Statistical Conferences, Rio de Janeiro, 1955. 

6 It should be stressed that the Federal Statistical Office did not publish such a synthetic publication 
concerning Yugoslav population. Its ten volumes of the 1948 Population Census results contain ten 
analyses of particular population characteristics. In doing #0, the Federal Statistical Office bore in mind 
the fact that the 1948 Population Census, giving first postwar results, was a short census only. By its 
nature, such a census was not in a position to furnish all the information needed to make a detailed 
population picture of Yugoslavia. Such a survey is to be expected after the publication of all the 1953 
Population Census resulta. 
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these 4 cities for additional periods during 1948-49. There is also a compari- 
son of results with a similar survey in 1942. 

In the text material the authors have made an effective separation of the 
technical details from the analytical summary presented in Part I. The sum. 
mary is not extensive. However it contains a good discussion of important 
family characteristics associated with variation in consumption. Its analysis 
of changes in family food consumption from 1942 to 1948 also is interesting, 

Part II, “Some Problems and Methods of Analyzing Family Food Data,” 
contains an excellent discussion of the limitations of the data and the pit- 
falls for the uninitiated analyst. It points up the interrelationships of factors 
affecting consumption and describes various techniques for their segrega- 
tion, including some illustrative examples, particularly for milk. It contains a 
good discussion of alternative methods of adjusting for differences in family 
size. Particularly noteworthy is the insertion of nontechnical descriptions for 
the technical terms used. For example, referring to income elasticity, the 
report states “income elasticity—a term for the ratio of rates of change in 
the consumption of an item and in income—is an indication of the order of 
urgency or degree of preference in consumption,” and again “income elas- 
ticity may be defined as the relative change in quantity consumed (or in 
expenditures) divided by the relative change in income, other things being 
equal.” The way in which consumption data by income group is adjusted for 
family size is very simply explained (p. 40). Only in the discussion of the 
more complex regression and correlation technique for determining factors 
affecting consumption (pp. 29-31) were the writers apparently unable to 
provide nontechnical explanations. 

Part II also contains useful analytical statistics, such as tables of income 
elasticities and coefficients of variation for selected foods. 

At the end of Part II is included a brief statement on seasonal indexes of 
food consumption. Because of the dearth of information in this field, the 
data shown are valuable as general indicators. However they must b ‘sed 
with caution since they were based on limited data and required impu ant 
adjustments and assumptions. The report states clearly that they represent a 
single year, 1948, but the term “seasonal indexes” appears to imply a greater 
degree of stability from year to year than was intended. 

Appendix B of the bulletin, “Methods Used in Collecting the Data,” con- 
tains an honest and complete accounting of the survey methods. Included 
also is a fairly comprehensive appraisal of the validity of the data. Unfor- 
tunately the fine print makes difficult reading of this valuable portion of the 
report. 

The sampling methods used deserve some comment. The city sampling 
design provided for stratification of all urban places by geographic region 
and size of city. An extra restriction (within a small tolerance) of State dis- 
tribution according to 1947 population was imposed, and random samples 
were redrawn for each stratum until the proper distribution was obtained. 
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The report states that 4 to 50 drawings were required. In the block sampling 
for the 32 largest cities, restrictions on the average number of dwelling units 
and average rental value based on the 1940 Census were imposed, which re- 
quired one to six drawings. Statistically, another method (possibly better) 
would have been to build such restrictions into the sampling design through 
further stratification. 

This sampling procedure appears to be akin to “balancing” discussed in 
some detail by Yates in his book on sampling methods,! in which the sample 
selection is such that the average value of the sample units for some charac- 
teristic is equal to the known average for the universe. In other respects the 
sample is equivalent to a random sample. Under Yates’ method, balancing 
would be effected through replacement of individual units by random selection 
rather than by complete redrawing of a cell, as was done by the Department 
of Agriculture. 

According to Yates, balancing by stratum should only be used when a 
“moderately large” number of units is selected from each stratum. Other- 
wise he says, “undue restrictions will be placed on the sample which will result 
in the selection of a sample which is not otherwise equivalent to a random 
sample.” The point is important since in some of their analysis the authors 
of this bulletin have treated the sample as if it were random. As an alter- 
native to stratification, the procedure would be effective only if the values 
under investigation (in this case, family expenditures for food) are roughly 
proportional to the differences in the known characteristic. Otherwise Yates 
says, stratification. would be preferable. 

The appraisal section of the bulletin considers the representativeness of 
the sample by comparison with Census data. As would be expected in view 
of the basis for stratification, data on characteristics of households in the 
sample agreed closely with Census data. The report then discusses the sam- 
pling errors of the data on the assumption of a random sampling procedure. 
It contains an enlightening table of sampling reliability for various foods 
by income group. Next, the data were reviewed for consistency and for con- 
formance to established consumption patterns. Finally a comparison was 
made with national estimates of the Department of Agriculture on per capita 
consumption. On all these counts, the sample appeared to be fairly reliable. 

Throughout Part II and Appendix B the writers are careful to point out 
limitations of the data, but in addition they provide their own overall evalu- 
ation of their importance. For sophisticated users, parts of this will seem 
obvious, but for users not accustomed to analysis of consumer expenditure 
data, this section will be valuable. 





1 Yates, Frank. Sampling Methods for Censuses and Surveys, 1949. Also “Review on Recent Statisti- 
cal Developments in Sampling and Sampling Surveys,” Journal of the Royal Statistical Society, Vol. 109, 
1946. 











1414 AMERICAN STATISTICAL ASSOCIATION JOURNAL, DECEMBER 1955 


Life Insurance Ownership Among American Families, 1954, Survey Research 
Center. Ann Arbor: Institute for Social Research, 1955. Pp. iv, 64. Free. 


Rosert Ferser, University of Illinois 


HIS monograph is another in the series of annual reports begun in 1948 
which the Survey Research Center has prepared on life insurance owner. 
ship based on data compiled in its annual Survey of Consumer Finances, 
One question was asked on life insurance in the 1954 Survey, namely: “Do 
you carry life insurance?” followed by subqueries on premiums paid in 1953, 
and any coverage at all during 1953 if no premiums paid. The organization 
of the report is built around the answers to this question, one chapter com. 
paring trends in life insurance ownership from 1949 to 1953, two others the 
relation of demographic and economic factors to insurance ownership, and 
a final chapter to a cursory exploration by means of cross-section tabulations 
of the interrelations between insurance ownership and these other variables, 
Like its predecessors in this series, much useful and interesting material 
on life insurance ownership is provided by this monograph. Many people 
will undoubtedly deplore the absence of information on the types of in- 
surance policies held, amount of coverage, and other characteristics, but 
considering the length of the original schedule and the subsidiary status of 
the general subject in the schedule, one cannot reasonably ask for additional 
questions on life insurance ownership. Within the framework of the present 
report, however, a number of questions can be raised regarding the organ- 
ization and analysis of the material. Briefly enumerated, they are: 

1. The basic analytical approach was to consider first and at some length 
the bivariate relations between life insurance ownership and various demo- 
graphic and economic variables and then in a final brief chapter the effect 
on these relations of the inter“ependence between variables. An attempt is 
made to eliminate the interacting influence of income, but the result is 
nevertheless that the reader is told only at the end that certain relationships 
demonstrated to exist by means of two-variable cross-tabulations are in fact 
due to interacting effects of other variables. This is what may be termed the 
“orthodox” approach in such cases—to consider two-variable correlations, 
then three variables, then those of higher order—but its desirability seems 
to this reviewer questionable in a study such as this one. Not only does it 
tend to confuse statistical laymen, who probably constitute the bulk of the 
study’s audience, but much of the preliminary material turns out to ve at 
best unnecessary and at worst misleading. More desirable would be an ap- 
proach that begins by explaining simply and briefly the problem of intev- 
actions, proceeds to examine the interactions in some detail and to derive 
the “true” relationships, and finally to consider the significance or implica- 
tions of the findings. 

2. Some attention might have been given to the type of audience for whom 
this monograph was intended. If intended mainly for statistical laymen, the 
presentation could have been improved considerably by using charts and 
other pictorial representation (which is completely absent) and relegating 
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many of the text tables to an appendix. If intended for statisticians, more 
powerful analytical methods might have been used, if only for supplementary 
purposes, instead of placing complete reliance upon examination of cross- 
tabulations. And if intended for both groups, a merger of the two preceding 
suggestions would not be at all difficult. 

3. Sadly lacking is a summary of the main results, preferably at the 
beginning of the monograph. There is, however, a two-sentence summary at 
the end of the last chapter which neatly presents the gist of the study: 

The analysis presented in this report has shown that the two factors which 
are most important in determining life insurance coverage and premium pay- 
ments are income and age. Other factors which are associated with insurance 
ownership and premiums are type of community, occupation, veteran status, 
liquid asset holdings, and payments to retirement and pension funds. 


Unfortunately, however, this summary is so submerged in the text that only 
the rare person who reads the study from cover to cover is likely to stumble 
over it. 

4, Hypotheses are conjectured here and there to account for certain ob- 
served relationships and left as such when a few extra cross-tabulations might 
have provided conclusive answers. Thus, on p. 49 it is conjectured that most 
spending units making no social security payments and not insured are 
headed by retired persons. Since occupation-of-head data were obtained, 
one further cross-tabulation would have tested this conjecture. 

5. Might the question on which this report is based be conducive to under- 
estimation of life insurance ownership? Spending units were classified as 
“insured” if premiums were paid on at least one member. Unless special care 
is taken in the interview, however, the “you” in the basic question “Do you 
carry life insurance?” might easily be interpreted by the respondent as re- 
ferring to himself alone. 

6. Some more attention to organizational details would not have been 
inappropriate. Thus, the organization of Chapter III is not as described 
earlier; there is some repetition of material (e.g., pp. 40-42) with preceding 
discussions of income effects) ; and the date given in the title should be 1953, 
not 1954 (interviews were conducted in early 1954 but the questions per- 
tained to 1953). 

Although these comments are largely on the critical side, most of them are 
relatively minor and by no means detract from the basic value of this series 
of studies or from the effort that has gone into it. It is to be hoped that it will 
be continued in the future and with a broader scope. 


The Study of Behavior: Q-Technique and Its Methodology. William Stephen- 
son. Chicago: University of Chicago Press, 1953. Pp. ix, 376. $7.50. 


Hersert Soitomon, Columbia University 


Fo some time now in psychology there has been discussion of ways in 
which sample values can be generated to represent observations from a 
population of values relating to only one individual. In general, classical 
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statistical methodology is constructed for inferences about groups of indi- 
viduals or inter-individual variability, and one has been furced to use these 
techniques in situations where inferences about an individual based on 
intra-individual statistics are really desired. One possibility which suggests 
itself is the gathering of data on a single individual over time. While this 
limits the type of study, it is certainly one way of generating observations 
whose analysis can lead to inferences about a single individual. Moreover 
there is a methodology for time series analysis which can be brought to 
bear on the data. In this book, Stephenson’s main contribution is a measure- 
ment technique by which one obtains sample observations relating to 
single individual in one time period. It is easy to see that in many situations 
this is more desirable than the collection of data over several time periods, 
and that even where the inclusion of the time factor is important, Stephen- 
son’s technique can still be used. 

The technique is a forced choice method in which a series of item state- 
ments are sorted by the individual along a continuum which has an ap- 
proximately normal distribution. Before the sort these statements are 
classified in terms of psychological variables. For example, in a study of 
selective service, the statement, “Although it is not a pleasant outlook, I am 
young enough to continue later if I am called” is supposed to represent a 
reasonable (as opposed to emotional or doubtful) attitude and a personal 
(as opposed to impersonal or doubtful) orientation. An emotional impersonal 
statement would be, “Why in the name of heaven cannot the ignorami in 
Washington make lucid and precise decisions with regard to the selective 
service problem without changing their minds every month?” A whole series 
of these statements, usually five or ten for each psychological situation 
covered in the study and 70-150 in total size, are placed on cards, shuffled 
thoroughly, and then given to the subject to place on a scale, in accordance 
with their significance to him, with, for example, the following frequencies: 


Most Significant Least Significant 
Score 100 9 8 7 6 5 4 3 2 1 O 
Frequency 2 3 5 8 12 12 12 8 56 3 2 


The cards must be sorted by the individual to give exactly the specified 
frequency. The scores are then analyzed, by standard statistical techniques, 
in terms of the pre-experimental classification of the item statements and 
the ceils in which they are placed by the subject. In the study of selective 
service, there were three levels of “attitude” and three levels of “orientation,” 
and the resulting data were analyzed formally by a 3 by 3 analysis of variance 
technique with replications. In addition, correlations between individuals— 
pairing scores on the same statement—car be calculated, or correlation 
matrices for groups of individuals can be subjected to factor analysis. The 
method may also be applied to complicated situations by introducing several 
levels of different variables and arranging the experiment in high order 
factorial designs, balanced incomplete blocks, or similar configurations. 
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It is through the factor analysis framework that the subtitle of the book 
derives its meaning. The jump off point in factor analysis is a correlation 
matrix. Various capital letters (R, Q, O, P, 8, T) have been assigned by 
psychologists to differentiate factorizations by the manner in which the 
elements of the correlation matrix are derived. The classical factorization 
which is based on correlations between traits is denoted by R. Factorization 
based on correlations between persons is denoted by Q. However, one must 
be wary and recognize that Q factor analysis is different from Stephenson’s 
Q-technique, although the author employs the term Q-technique for a col- 
lection of procedures designed to lead to inferences on the individual. 

In the latter half of his book, Stephenson discusses the use of his Q-tech- 
niques in several areas of psychology: type psychology, questionnaire analy- 
sis, social psychology, self-psychology, personality, projective tests, and 
clinical psychology, devoting one chapter to each area. 

Statisticians will find this book hard going; the descriptions of the tech- 
nique are often vague and poorly organized, and the style discursive and 
flowery. Also, statisticians need not involve themselves in the controversies 
between the author and several author-selected prominent factor analysts, 
but even the casual reader will admire the manner in which the author does 
not lose one engagement. The justifications of the technique as a device for 
measuring behavior of an individual are, at best, intuitive and heuristic. 
Certainly both experimental and statistical problems will arise in the initial 
classification of the statements before they are sorted by the individual in 
the experiments. In addition, several statistical problems are raised. The 
assumptions of independence and normality of residuals required by the 
standard analysis of variance are not necessarily satisf.cd by the manner in 
which the data are generated. It may turn out that this is not too critical, 
for, as is so often true, selection rather than hypot’.. sis testing procedures 
are really desired. 

All in all, Stephenson’s procedures represent an interesting development 
in psychological measurement which will require much more theoretical and 
experimental research for their ultimate appraisal. However, the author 
certainly does not provide a solid basis for his suggestion of carte blanche 
usage of the Q-technique in many areas of psychology. The uninitiated and 
the unsophisticated in statistics and measurement will do well to proceed 
with caution in the employment of Stephenson’s techniques in a psycho- 


logical inquiry. 


Share Ownership in the United States. Lewis H. Kimmel. Washington: The 
Brookings Institution, 1952. Pp. xi, 140. $1.50. 


Tuomas R. ATKinson, Federal Reserve Bank of Atlanta 


oME of the more glowing preliminary press accounts of the Brookings- 
New York Stock Exchange investigation of stock ownership in late 1951 
seemed to suggest that the study was designed to prove that nearly everyone 
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owns corporate stock. If there is even a grain of truth in this suggestion the 
published study demonstrates the valuable service that an independent 
foundation performs in providing technical supervision and assuring sciep- 
tific impartiality in the business of politico-economic fact-finding. It js 
quite probable that the Kimmel study will be a definitive work on stock 
ownership in mid-century. 

How many people own corporate stocks? This is the major question 
attacked by the author, and his approach shows considerable ingenuity. 4 
canvass of corporations open to public ownership provided an estimate of 
the number of shareholdings, which figure could then be deflated by the 
average number of separate issues owned by stockholders as determined by a 
survey of individuals and family units. The answer obtained by this method 
of computation—7.2 million individual owners or 4.6 per cent of the popu- 
lation—lies within the .05 confidence interval of 4.2 per cent of the popula- 
tion, the direct estimate obtained by probability sampling of the spending 
unit universe. The degree of correspondence of the two estimates lends 
considerable credence to the results; and for the first time in the area of 
counting stock-owners, there seem to be no vast holes in the statistics. 

Although the above problem is the raison d’etre of the study, most of the 
text is devoted to detailing the characteristics of shareholdings and share- 
holders. Men and women are about equally frequent as holders of publicly 
owned stock issues; the Middle Atlantic States rank first in number of 
shareholders among geographical divisions; and most holdings consist of 
less than 100 shares of any single issue. This much is learned from the 
analysis of shareholdings. Turning to the survey of shareholders, people 
aged 50 to 59, people with 4 years or more of college, people in the higher 
income grou 9s and in executive and professional work are most frequent 
stock owners. All of these data and more will be of interest and possible use 
to the investment profession and to teachers in the field. Kimmel adds to the 
usefulness of the data by untangling the web of nominee ownership, a 
problem that has baffled previous students of the subject. 

To the sociologist, historian, and economist the emphasis on number of 
shares and number of shareholders may seem less desirable than an analysis 
based on dollar value of holdings. While the author presents some data on 
the dollar value of holdings, the data are only for his basic sample of cor- 
porate shareholdings and he does not attempt an estimate of dollar value of 
all publicly owned stock by type or characteristics of owner. One is left 
to wonder, for example, whether fiduciaries do own 16 per cent of the total 
value of stock of all publicly owned corporations, the percentage shown 
for Kimmel’s reporting firms. Furthermore, no attempt is made to determine 
the distribution of dollar value of holdings by income groups, an omission 
of some consequence to those interested in the distribution of income and 
wealth. 

Current interest in this study will probably center around what it tells of 
nature and consequences of the bull market in corporate equities and the 
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ensuing crash in capital values in the fall of 1955, even though the estimates 
were made four years ago. In particular, Galbraith’s thesis that the 1929 
market crash played an important part in causing the depression of the 
1930’s by striking at the spending and investment activities of the wealthy 
should send analysts of current conditions searching for the facts on stock 
ownership. This study will be a good place to start that search. If Brookings 
and the New York Stock Exchange could be persuaded to attempt a similar 
study of facts of stock ownership as of the fall of 1955, much would be added 
to our knowledge of the dynamics of the investment process. 


Factors Influencing Consumption: An Experimental Analysis of Shoe Buying. 
Technical Paper 10. Ruth P. Mack. New York: National Bureau of Economic 
Research, 1954. Pp. 124. $2.00. Paper. 


Guy H. Orcutt, Harvard University 


HIS monograph attempts to outline the factors responsible for changes 
_ ate time in aggregate shoe buying by consumers in the United States 
in the interwar period. As reported by the author this study was undertaken 
in connection with her forthcoming book, Consumption and Business Cycles, 
A Case Study: The Shoe, Leather, Hide Industry, in which, in briefer version, 
the material presented in this monograph will constitute a chapter. 

All serious students of consumer behavior will wish to read this mono- 
graph, and its pleasant style and careful analysis will certainly lead them 
to study the author’s forthcoming book. 

Primarily on the basis of aggregate monthly time series relating to the 
period 1929-1941, the author investigates the relation between shoe sales 
and a number of variables including personal income, shoe prices, change in 
disposable income, time, and estimates of consumer stocks. Some considera- 
tion is also given to the possible influence of changes in the distribution of 
income and to population: its size, age, and family structure. A number of 
these variables are considered one at a time in relation to shoe sales, and in 
doing this various auxiliary pieces of information from budget studies and 
elsewhere are utilized. Then single equation, least square, multiple regression 
methods are used to investigate the multi-variate relation of various com- 
binations of the above variables to various variants of shoe sales. 

The variables which show up as being significantly related to shoe sales are 
personal income, trend, and shoe prices relative to the Cost of Living Index. 
Of these three, hotvever, the evidence supporting the relevancy of the third 
is very restricted; and the second, as the author points out, represents the 
combined result of a number of variables which could not be expected to 
have the effect of a linear trend for any extensive period outside of 1929- 
1941. 

The failure of this study to establish much, if any, role for any variables 
except income and time is not regarded by the author, or by this reviewer, as 
very substantial evidence that there are no other variables which play a 
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significant role in determining shoe sales. All the usual problems of effec. 
tively utilizing aggregate economic time series are present. These certainly 
include multicolinearity, autocorrelated time series, simultaneous relations 
and/or feedback, errors of observation, and small number of effective de. 
grees of freedom for testing hypotheses about anything but extremely short- 
run changes. 

The author makes a number of valuable suggestions on further study in 
this area, including the greater use of diverse evidence, such as, evidence 
from area surveys and psychological studies as well as from time series, 
In this the reviewer stronzly agrees and he would like to reinforce the 
author’s suggestion that vastly greater use be made of small area surveys in 
which an attempt is made to eliminate all but a small number of variables 
by selective sampling. Another possibility is that some gain in effective 
testing power could be achieved by working with data relating to Federal 
Reserve Districts or even states rather than with the national aggregates, 
It also seems possible to this reviewer that more effective use might be made 
of the fact that monthly data are available. 


Mathematical Thinking in the Social Sciences. Paul F. Lazarsfeld, Editor. Glencoe, 
Illinois: Free Press, 1954. Pp. 444. $10.00. 


M. H. Srone, University of Chicago 


_ present book is not an easy one to review. Whatever its title may 
suggest, it is actually a collection of individual articles dealing witha 
number of specific situations not by any means fully representative of the 
different modes of mathematical thinking in the social sciences. Thus the 
book has to be regarded as a potentially biased sampling of problems, meth- 
ods, personal attitudes, and individual styles. The introduction by Paul 
Lazarsfeld, the Editor, inclines to be a summary of what is in the book rather 
than to locate its contents in reference to the broad field indicated by its 
title. A list of the authors included in the collection and the titles of their 
contributions will therefore serve to give the best possible general idea of its 
nature. The volume consists of eight papers, together with the Editor’s 
introduction, as follows: (1) T. W. Anderson, “Time Changes in Attitudes”; 
(2) Nicholas Rashevsky, “Imitative Behavior; Distribution of Status”; 
(3) James S. Coleman, “Expository Analysis of Rashevsky”; (4) Jacob 
Marshak, “Probability in Social Science”; (5) Louis Guttman, “Principal 
Components of Scalable Attitudes”; (6) Louis Guttman, “The Radax”; 
(7) Paul F. Lazarsfeld, “Latent Structure Analysis”; (8) Herbert A. Simon, 
“Strategic Considerations in the Construction of Social Science Models.” 
The paper of Coleman is a presentation of Rashevsky’s material in non- 
mathematical language, with indications of additional fields of application 
for Rashevsky’s arialysis; and the papers of Guttman and Lazarsfeld are 
rather closely related discussions bearing on modern technical problems in 
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the general field of factor analysis. Otherwise the contributions are quite 
independent: they deal with different phenomena and, despite their common 
use of the basic elements of probabilistic reasoning, diverge in a variety of 
mathematical directions. For example, the two most mathematical papers, 
those of Rashevsky and Anderson, involve respectively the solution of 
unfamiliar functional equations and the handling of Markoff processes. The 
reader will thus find an interesting range of problems and ideas expounded 
by a list of distinguished authors; but he will not find, even in Simon’s 
essay, @ general discussion of mathematical methods in the social sciences 
or examples of such important methods as those of game theory, linear 
programming, classical mathematical economics, population theory, or 
“social physics.” 

Having tried to indicate thus succinctly what the book is and is not 
about, the reviewer can perhaps best perform his function, within the scope 
of his own competence, by recording some off-hand reactions of a pure 
mathematician to this collection of papers. If these reactions are taken as 
catalysts of thought and reflection, rather than as serious statements of a 
position, they may serve a useful purpose 

A striking feature common to the majority of the papers is a sensitive and 
tender concern for the nonmathematical reader. By it the writers are led 
away from the conciseness of good mathematica! presentation into labored 
disquisitions and involved circumlocutions intended to help or to placate the 
reader in question. If one were convinced that the end-product were il- 
luminating or persuasive, one would not be inclined to quarrel with this 
approach. However, the reviewer is inclined to think that the results are 
far more likely to be disappointing, because the disquisitions by themselves 
fail to convey what has to be conveyed and in some cases even wind up 
rather lamely by yielding the last word to a mathematical commentary 
which can only heighten the confusion of the nonmathematical reader. It 
seems to me that if I were ignorant of mathematics, I would be more thor- 
oughly persuaded of its effectiveness and value by a clear statement of 
quantitative or qualitative results arrived at by a frank and patently com- 
petent mathematical treatment than I would by a labored verbal analysis 
incapable on its face of producing any such specific results. If the attempt at 
nonmathematical exposition risks failure so far as the nonmathematical 
reader is concerned, it also risks discouraging the mathematically competent 
reader, who is more than likely to be put off by the task of slashing away 
the luxuriant verbiage beneath which the essentials are to be found. While 
there is indeed a place for discussing the pertinence of mathematical reason- 
ing to the task of the social sciences, it does not seem to the reviewer that 
the best place is in technical papers devoted to specific suvjects. The need for 
apology, evidently felt by most of the authors in the present book, is an 
interesting indication of the present state of the social sciences, and points 
very clearly to the importance of reorienting the preparation of future social 
scientists. 
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It is interesting and challenging to note that a good many of the mathe. 
matical formulations advanced in these papers lead to mathematical prob- 
lems which at present defy solution. This phenomenon is perhaps more 
marked than it is in the case of the physical sciences, though there also it js 
common enough. It is essential for the justification and future development 
of the use of mathematical methods in the social sciences that means should 
be found for solving both theoretically and computationally enough of such 
problems to permit a wide range of comparison between theory and obser. 
vation. In general the authors do not isolate these problems in a clear-cut 
way from the general discussions, thus leaving the mathematician who is 
tempted to try his hand at some of them to pick them out by his own efforts, 
A consequence of the mathematical obstacles encountered in a number of 
the papers is that they rest at a speculative level and have to be justified 
by argumentation rather than by comparison with empirical obser vations 
and results. This points out a marked contrast with what one would find in 
a similar series of papers on topics in physics or engineering: in the latter 
there would occur repeated comparisons between theoretical results (often 
arrived at by crude approximations to which recourse is had because of the 
genuine obstacles to an exact mathematical treatment) and observational 
data. No doubt as the art of mathematical analysis of social phenomena 
progresses authors will follow this well-established and desirable practice of 
the physical scientists. 
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